'03f07fll4Q(I)13|52j} 5St:B S K B 



rn'Mimmryhm, 



R;624 



P. 04 



60/012705 




Si 

.4 



PR0VISK3NAL PArSNTAPPUGAllON COVER SHEET 



VoLrath 



TTTUO^THCMVEKTlON^OhMamM^) 







SUN* 1 2toC» 




, U9 tVtW* f-M 


h - 


U-SX 


CNCLOSeD Affffl.lCATlOM PARTS /0M «r rtc YTT 




□ — — 



'03$07|14H(|)l3l|52i^ ||;B S K B 



R:624 



P. 05 



60/012705 



PNA MOLBCULCS ENCODING 
PL.<NT PROTOPORraVRINOGEN OXIDASE 
AND INKDITOR-RESL«TANT MUTANTS.THEREOF 



FIELD OF THE INVENTION 



\ The inveniion mUies lenenlly lo the pltni enzyme pfo co porphyrinogen oxidise 

I 

iio Orotox'). InpvticuUr.thtini^nikmrelmstoDNAinolecu^ 

I modifietL inKibttor-re&iflim fonm of this ciuymc The mveniiOA further nlaies to methods for 

*i tissue culQirc selection vkI heiteide iif^)ljc«t)on b«sed on these modined fonns. 
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BACKGROUND OF THE INVEf^ON 

L The PioUn Eazynir Hftd lu InvolnmctU In the CUoraphytl/HciiM Biosyiilfacac 
Ptthway 

The bioitynihctic fiuhways which lead to the pR^udion of ^torophyd ind heme share a 
number of commofi steps. Chlofophyll is a lighi harvesting pigmeiu present in aJI green 
photosynchcf ic ors^nisms. Heme ts a cofactor of hemoglobin, cytochromes, P430 miaed-funciion 
osy ccnases, peroxidases, and catalascs (ut, f .g. Uhninger» Btorhemi^ffv. Worth Publishers, 
New York (1975)). and is thentfore a necessary component for all ■erobic orginbm$. 

The Iftsi common step in ^kmphyll and heme biosynthesis is the oxidation of 
protoporphyrinogcn IX to protoporphyrin [X. Pmopoiphyrinogen Q%\dSiSc (refencd to herein as 
"protox") IS the enzymte which catalyzes diis last oxidation step (Maiiingc ri Biochsm. J. 260: 
23I(IW9». 

The proiox enzyme has been purified cither pantaJly or completely front a number of 
OTf anisnu including the yeast Sacckarcmycei cm^Uir (Lsbbe-Bois and Labbe, In Biosynthesis 
oLUmc^OllomphxUp E-H. Oailcy. cd. McCr«w Hill: New York. pp. 235-283 (1990)). barley 
ciloptasis (Jacobs and Jacobs. Biochemt J. 244: 219 ( 19B7)), and mouse liver (Dailey and Karr. 
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pcoidnU-pprnximidyll kI>c««l->oci«e. wiihtf»oeIl.i«nibi»e. Tl-Ai-Wtopwein- 
Si kDti, uid is t solubkt qrtopUonik activity 

P^x encoding |en« ta« iww risa been i»b«ed (h-n 1^ 

ffitfl. CAml 27<)r«;; W76-M80 (1995) and p\$M (fcttmrtiPMl tpplkatioB i». 
PCr/IB95A»«52 ritodJuneS. l995.p*IishBdDcc.2l, I99S»W09»34659). 



IDe Pram Gene u ■ HvUdde Tu^ 
The ui« rfheiWcides wcondol wwtairiWe veget^ 
h,,l«o«*.«mc,u.«ni^pr«tkc. Ttenaeventn«tae^^ 
Despite this «oensivc itfe. ^ eonool fBiaim e lignincmi »d ««ily preWem to finnen. 

Effective u«ofh«bk:«fciitquii««ogndm««geinenL For iwunce, dm: end method of 
sppticiiion end itigeof weed pl«tdevdopi>ie« siecritkal to getting good weed eofUiol 
tefWcidei. Sinoe Vinous N^<«ie.eie«ii»ntloheAk:ide*,iheproduc^ 

tartMCides beeoRWK iameshigly impflttim. 

Unfomttteldy. hefbiddes lh« cxWbU g««er potency. 
ripid degndeiion in .dl c« 4l« hive gie-er eiop phyioioxidty. One lelution spplied to this 
problem has been lo develop erop. which «t i«i«-tt or lokc«t lo he Aicldes. Oap hybrids or 
v„i«ics ie«s«anl u> the hcrt»cides •How forihe use of the heiblcid« without attendsnt ri<k of 
di„«^to«he«op.De-.lopi»«.iofi«i«snoecw JJow,pp«c«ionof sheibieide toe crop 
wheie iu uie w» p«viously precluded or limited (e.f . to prt-cmeigence u.e) due to icnritivity 
of Uie crop to the herbicide. Foreuinple.U5.Pwent No. 4.761 j73to Andenoneratb 
directed 10 pUnuieiijtiM to viriou*iinidMO>inonecriulfOB«iiideheit«^ Tbe tesijunee is 
We«dby«..Ii«td.cetohydro.y«:idiynlh«.(AHAS)eBzy,«c UJ!.P«e«No. 4.975J74 
,0 Goodm« « letaies to pUn. cili ««1 pUntt conlsifting » gene encoding • tnul«i gluumine 
lynthcute (OS) twiitam to inhibition by be(tM:ide> thai wm known lo inhiWi CS. e g. 
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piKMphiiiattiricin and fnethkmine sulfoxhrnnc U^. Psient No. Sjai3j659 to Bcdbrook <r a/, is 
dirocted to phnu ihti exptss » muttni KCtolww lymhtM which ittideri the pl«m> i«stscanm 
tflhibftwn by sulfonyIuR» herbicides. U.S. P«cm Na S«l62j6Q2 lo Somers et al. discloics plants 
lolenmtoifthibitioAbycydohexjnedimandaiyl^ The 
loIeniKe is confcmd by an aJiered ace^l coensyme K c«rboxylas«( AOCaie). 

Thepmoxenzymtf^i^ihetaiiges/ora virkiy ofhertk^^ The 
herbteidcs (hiit inhibit procox inchide miny difTetcni stmctunl cU&scs of moiecuks (Duke €i aL, 
W€€dSci,S9i46S(mir. Stna^ii€taL,PafiddtB:ochem.PhynoL43: 193(1992); 
Matrinie ei oL, FSBS UtL 245: 25 (I9S9}; Yanatt and Andoh, Petticide Bkichm. Fk^ SSi 
70 (1989)). Tlttse heibieidai oompounds tnelode the dipbcnylethen {e^« ■dnuorfen, 5-(2- 
ehllm^triflaotQlnethyl)pheAoxy^2-mtlobe^ add; its methyl ester, or oxyfhiorfau 2<hlce^ 
l-{3-ethoxyU-iiitrophettoxy)^trinui]niber^ oxidiazoles« (e.g« oxidiazoiu 3*PL4-dkhlon>- 
Ml-fnelhylethoxy)phenylMKl J-dinwihyicthy2)*U^xa^^ cyclic Unidcs (&f . 

5«23 1 41 y^K4-ehloco*2-nuorD*5-pixiparKy lc»ypbeflyl)-3,4 J.6He»^^ 
dilorophihaiinu VK4<hloffopheny()-3,4^ii-ienhydiophthi2imk^ phenyl pymolcs (e.g. TNPP- 
ethyl. cihyl 2-IH2.3.4Hn^brophenyl>4-mcropyrazolyh5«oxy]pfopkmai^ MftB 39279), 
pyridine derivaiives (e.g, LS S2*556)» and phovopyiaie and its Ofhenylpynolidino- and 
pi peridinocafbamate sralogs. Many of these compounds oompeutivcly inhibit die nonnal leactioa 
caulyzedbyihecnTymcappaienily acting as tubume analogs. 

Typically, the inhibitory efTect on protox is determined by measuring nuoiescenee at about 
622 IQ 635 nm. afier cxciuiion at about 395 to 410 nM («ec. . Jpcobs and Jacobs* &tyom 28: 
206 (1982}; Sherman eroL Plant Physiol 97: 280 (1991)). This assay Is bused on the /act chat 
protoporphynn tX ie a fluorescent pigment and pro oporphyrinogen EX is nonfluoces^nt. 

The prtdfcted mode of action of prtKox-inhlbiting herbicides hvolvei the aocumulalion of 
profoporphyrinogen IX in ihe chloroplnst This accumulation is thought to lead to leakage of 
pnKopacphyrinogen IX imo the cyiosol where it is oxidiaed by a peroxidase activity to 
piocoporphyrin IX. When exposed lo iighu protoporphyrin DC can cause fonnatioo of itnglec 
oxygen in the cyioiol. This linglct oxygen can in turn lead to (he formation of other reactive 
oxygen (peciea. which can cau^ic lipid peroxidation md mcmbrvie disruption leading to rapid oetl 
death (Ut ei oL. PtM Pky$iaL 102: 88 1 ( 1993)), 
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Not procoxcn^m»ifcsenMdvekiherbiddtt whkfai^^ 
Bochof thepmoxcn^yinescrcodedby geaesisol«tBdft^ dL, 
Cvt / l^ienMaL 39: < 155 (1993)) ind BacOius MuhtUis (Diiky «f at, / BiaL Chm. 259: 813 
(1994)) ire ftslstsm to these heibicsdiUmhibte^ Inad^tkxunwuiitsof cheontenularalga 
Qdamydomonas nuOiofdtut^AstmfP^^pmySm^ Iwvt been itported 

(lUuoki ef oL. / Pef^£idt ScL IS: 449 ( 1990); ShibaU et In ftasgyA in P^orowntf^ifi. 
VoL DL N, Mvmi, cd. KlQwerKethertuxte. pp. 567-570 ( t992)). M leM one ef riiese mutinu 
ippem to have in pi^toi acti viiy that IS resisum no^ 

which the mutuu w«s fcleeied, bu also lo other elisses of preton Hihibiton (Oshio «f «t» Z 
Satwfonck 4Sc: 339 (1993); Sato el at* In Sympftaium on parphvric ftarirideg, 5* Duke, 
ed. ACS Press: Washin{iOA,D.C( 1994)). A mutam tobacco ccllline his «i»o been Rpoited that 
is resMtim U> the inhibitor S-21432 (Qie ci Z Namtfonck, 48c: 350 (1993). 



SUMMARY OF THE INVENHON 

The picsem invention provides isolated DN A mokcules and tAimetk genes encoding (he 
procopofphyrinot«n oxidnse (pioiOJt) cnryme rram soybean end whesL The sequence of soeh 
ifiolstedONA molecules ait let forth in SEQ ID Noft. 1 1 (soybean) and 9 (wheat). 

The pfwm invention also provides modirted formsof the plant pfotoporphyrtnogen 
oaidasc (pitxox) eatytnc which are tesistant to connpounds that inhibit unmodtried naturally 
occumng plant protox entymes, and DNA molecules coding Tor such inhtbiior-fesistam plant 
pfuox en^^mes. The present invention includes chimeric genes and modified forms of naturally 
occunnng proioi genes which can cipfCis the inhibttor-iesictant plant pmtos enzymes in plants. 

Genes encoding inhibitor-resistant plant protox enzymes can be used to confer resistance 
10 proioi-inhtbiuify herbicides in whole planu and as a selectable nuiter in plant cell 
trans/ormatton methods, Aoeordingly. the present invention also includes plants, plant ttsuces and 
"pliAi «eeds coniainiflg rnant capressikMc genes encoding these modiricd pcotoa enzymes. Thei£ 
plants, plant tissues plant seeds are ivsisiint lo pratox-lnhibitors ai levcU which normally are 
inhibitory lo the naiuraily oocuning protos vtlvity in (he pUnL Plants OKompassed t»y the 
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invcfitkm eqiecially bchxk those i^tkh would be pocendtl cicyets forproioz nhibitini 
hetbidte. pMicuUriy iStOAomkally 

tt wfaeittom. rye, scxghum. rice, tertey. makcturfandron^fFisse^aAdthclifce^iawdltt 
00000, ni^cnc, nfiir beet, oilseed npc, ntd soorlnai. 

Thepr«£ciit inveodoflistfTectolfufihertoinetbodsfortheprodw 
tissues, lod pUat seeds iMhicb produee an inhibiior-restsuni ferm of the plant pitNox enzyme 
pRivided bottn. Such pUnts may be stably if insfm i n e <J witti a mvcooTl fcne eticodinf the 
resistant pratoiu or picpared by diiect setetkmieehtiiques wherAy hobickfercsisumUnesm 
isolaiad chncterixed and developed. 

The presettt invention is further df iMed co prabcs and methods for detoctint dK p res enc e 
of genes cncddiAf inhibiiDM«ustam foitM of the pUnt pratox ersyme and quantitating kiMts of 
rnhibilor-ieusiani praiox tnnseripts in plant tissue. These methods may be used u> ideatify cr 
fcroen for plants or plant tissue oonuining and/er eapcessint * tf^ cneoding an inhibitor- 
resistam form of the plant piotox estynie. 
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DESaOPn NOP THE SEQUENCE U^nNC 
SEQIDNo. 1: DNA coding aequenoc for m A r Mdop s is AaUatia pxff>xA prottin, 
5EQIDN0.2: Arobidt^sis thaliMa pfOt^x*] amino acid stquciw encoded by SEQ ID I^. 
t. 

SEQ D No. 3: DNA coding sc«}t^nce for an Arabidofm ihaliana protox-2 praicin. 
SEQ ID No* 4: Anifidopsis ihaiiM pMo%'2 amino acid scqoenoe encoded by SEQ ID 
NoJ 

SEQIDNo.5: DNA coding sequence for a maiicpiotoi- 1 pnxein. 

SEQ ID No, 6: Maiae pmoi^l amino acid Mquenee encoded by SEQ ID No* S 

SEQ ID No. 7: ON A coding uquence for a maize pfDCoK-2 procenu 

SEQ ID No. 8: Maize pracoK-2 amino ackl sequence encoded by SEQ ID No. 7 

SEQ ID No. 9: DNA co^g sequence for a wheal protox- ) peoCetn. 

SEQ ID No, 10: Wheat procox-l amino acid sequence encoded by SEQ ID No. 9. 

SEQ ID No. n : DNA coding lequenoe for a soybean pnKox-l protein. 

SEQIDNo. 12: Soybean proiox- 1 protein encoded by SEQ ID Na 11. 

SEQ ID NO' 13: Promoter sequence (romAmhidopsis thaliana proioxO gene. 

DFTAILED DESCRUTION OF THE INVENTION 

precox Codini Sequences 
In one aspect, the present invention is direcied to an isoiaied DNA mokcule which 
encodes pfocopofphyrioogcn oxidase (refeffcd to herein as '>t«Ox'*). *c enzyme which caialyxes 
the oxidation of ptoioporphyrinogen IX lo protopoiphyrin DC. from soybean and wheat. TTie 
DNA codmg lequenee and corresponding amino acid sequence for a soybean protox enzyme is 
^ pn>vided as SEQ ID Nox. 11 and li rcspeaively . TIk DNA coding sequence and conr^wndirg 
amino Kid ftcquei^ for a wheat pratox enzyme is provided as SEQ ID Nos* 9 and lOi 
fCtpectivcly. 

DNA coding fcquenco and cortesponding amino acid sequences for proiox enzymes 
from At^idnpiis ihaliofUM and maize which have been previously isoUicd are refModuced hemn 
SEQ ID Nos. 1 *4 (AfMdopsis) and SEQ ID Nos 5^ (maixe). 
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the isobxed euUiyotk piwix icqi«v^ 
flu&ipuliced Mcording 10 itifidinS gnwt^ 
Fgr exinvte; ihe cfttife pfotox icqiicrtce Of ^ 

ipoaficrilyhybodixtngtoprowxcotfiof To achieve spectfte 

hybridizitkm v«w^ t vtfkty of coodit^ 

proCQx coding sequettts tnd «re prcfenbly it kjox 10 luiclcoCides m length, and mee: prcfekibly 
atleftfl20nuclooiidesinlenftti. Suchpiobesnuy bcuscdiotinplify tndaniiytepiocoxc^ 
•6qiieMcsfnmi«diaacnartaAiSfnvU(hew^^ polymerwc duun leactioo 

(PCR). Thisiedmi^i^Auy be useful to Uolaie addiikioil pfolox coding sequence* ftom a dciirf 
ofgaAum or uidiigoosdc«»tty ID determine the presCM of coding sequences in en 
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spedfic hybfftdizaiton prabes nwy Also it^ 
caAafyeiie protOK |pei«{s) in ihe genome of i choaen orgsnism using sundanJ lechmqua bued 
on the selective hybridizauon of probe to genomic piotoi se qu ences. These techniques 
include tNit ve noc Umtted to. idemiTioition of DNA polymerpbims idemified or eontained 
within the proiox probe tcguencc. and use of 5iw;h polymoiphisma lO follow segivgaueA of the 
pfo(0« gene relative to otter markers cf known map position m a mapping populatitA dmved 
from self feiiitization of a hybrid of two polymorphic paiemal liihes (see cj. Helen^aris ts aL, 
PUua MoL Bid. 5: 109(1983). Sonuner^rol SioMvMgiiei 72:82 (1992). Dt>vidior^ 
FlM MoL Biol 15: 169 (1990)). While any eukaryotic pratoa scqwenee is oontemptaied 10 be 
useful at a probe for mapping piotni ^enes from any eukaiyocic organism. preferT«d probes are 
Cho<«« protox sequences from organisms more cto>eIy itlateO io the chosen organism, and most 
prererred probes arc those proton tequenccs from the chown organism. Mapping of ptotox genes 
in ihii manner it CMiemplaied lo be psnicularly useful in planu for breeding purposes. For 
inOAiM. by knowing the genetic map position of a mutant proiok gene thai confers herbicide 
rtsistmcs. nanking DNA marken can be ideroified from a reference genetic map (soe« e.g.. 
Heknijaris. Tmi^ Cfurf. i: 2f 7 (19^7)). During introgression of the herbicide rcsisunce trail 
into a new breeding line. th»e marken can then be uud lo monitor the extern of proc^a-linked 
banking chroriMMomal DNA still pfeient in the iBcuncnt parent alter each round of hack-crossing. 
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Pmw spedfc hybridizukin probes nvy «Uo he vced to quamim ImU of procox 
mRNA in m ^rfuusm usifif suuidivtf tedviiques such as Korthem bloc Mialysis. TUs icchiuqiie 
mty be uieftiltsi diagnostic issay todetcet4llei«dkvei$ofpct<oxexpfastonih«maybe 
anccttffff widi paitkiitar advose c^dons such as autosomal donrunant disorder in homans 
«haractcfiz«d by both ncuiopaychiatne tymptoim and skin toions, have dccreaaad levels of 
pnxoK •ctivicy (Brenner and Bloomef « New EngL h Med. 3Q2i 765 ( 1 980)). 

ForieoombiaamproduetiOAoriheeAzyniem coding aequenoe 

may be inserted into ai^ expresaioo cassette ddigned for thechoaen hosi and IncrodiMed tiuo the 
boll when ii is leooinbinantkypioduced. Thcehoiocof speoiriciegutaiarysapKAeessichis 
pramour, dgnai lequcnoe, 5' and 3* umnnslatcd sequences, and enhancer, is within the kvd of 
skill oTtherowtiMer in the aic llieitsttliaM molecule, conuimng the cAdiWduaieteiii^ 
in proper leading frame, may be ituetted into a vector capable of being bansfonned into die host 
cell. Suitable expresskMi vecion and methods for recombinani productjon of proteins ate well 
known for host organisms such as £ coli (sec, €.g. Studierand Moffait, / Mm. Bid, J89: 1 13 
(1986); Brositis, DNA 8: 759 ()989))» yeast (sec, e.^. Schnekkr and Ctiarente. MeUu £mymoL 
m: 373 ( 1 991)) and insect cells (see, f.g.. Luckow and Surmrm. Buj^tthnoL 6: 47 (1988)). 
Specific examples include plasnuds such as pBluescript <5traiagene« La JolU, CA), pFLAC 
(tntemaiional Biotechnologies, Inc., New Haven, CT). pTrcHis (Invitrogen, La JolU,CA), and 
bacufovtrus eaprvsiion vectors, e.g., those derived from the genome of Autagraphica ctiiifomica 
nucieer potyhedrows vtrvx {AcUSPV). A prefened hacuioyiiui/inscct system is pVll 1392/Sf2l 
cells (Invitrogen* U Jolla, CA). 

Recombinanily pnsduced eukaryotic proiox enzyme is useful for a variety of purposes. 
For exunple. it may be used to supply pfotox enzymatic acliviry m viiro. It may also be used in 
an in vrr^n assay to saeen known heibicidal dienvicals whose target has not been identified to 
^detemiine if they inhibit pfntox. Such an ui viiro assay may atui be used ai a more Eenenl fcmen 
to identify chemicals which inhibit proiOk activity and which are theiefote herbicide eandidatcft. 
Rcoombinanily produced euksryocic protoc enjiyme may also be used in an a»say to Identify 
inhibiuir-ffesiaant pnxox mutants {tee International application no, PCTyiB9SAtt4S2 fifed iune 8, 
1 995. published Dec. 2 1 , 1993 as WO 9S/34639* incorporated by reference herein in its entirety). 
Aitemaicvely, necombinamly produoed protox enzyme may be used to ftuther charadcrixe iu 
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«uoei«lioA wtth known tnhibhon in onier to ntiorolly desi^ new inhibitory beiMcides is well as 
herbicide toiertnt forms of the enzyme. 

Inhibitor Resigunt PlM PrMz Enzymes 

{ninother»«pect,te present invention ie«c))es iiirpJe modirications which can be mide 
10 Che Amino icid lequenoe ct tiny plant pn K op oi ph y rinogcn oxidase (nfcrred lo herein $s 
^'pnxoJt '} enzyme Co yieU an inhibitorHnesistAni form of this enzyme. 

The present invention u dirtcud to inhihttor^icsisant pluit protox enzymes hiving the 
modifiCAtJons taught herein, and to DN A molectiles encoding these modified enzymes* and to 
genes cipabk of eaprn&ing these modified enzymes in plants. 

The present invcmion is fuither directed lo plants, plant tissue and seeds tolcrvK to 
heiteito thit inhibit the natoraDy occuning prates activity in these plants;, wherein the toknuiee 
is confentd ty a ger^ expressing a modified inhibiior-icsistant procox enzyme as taught heicin. 
Representative pUnu include any plants to which these herbicides may be applied for their 
normally intended purpose, Picfciml are agfonomicaily important Cieps* i^., sngtosperms and 
gymnofpermi such a^ cotton, soybean, npe, sugar beet maiz£. rice, wheat, barley, oats, rye, 
iorghum. millets turf, forage^ lurf grasses and the lil^. 

. The rnodifiedinhibiiDr-fesistantprotox enzymes of the invention have aile^ 
KTid substitution, addition or <klction rttative to their naturally occurring coumerpan (i.c 
inhibiior^scnsitivc forms which occur naturally in a pUni without being manipulated, either 
duecUy via recombinant DNA methodology or indiieetly wd selective breeding, etc.. by man^ 
Amino acid positions wttich may be modified \o yield an inhibitor-resistant form of the proton 
cnty me. or enhance inhibitor le^isiance* aie indicaied in bold type in Table I in (he eoniext of 
plant procoi-l ^uenccik from Arttbidapsis, maiac, soyt)ean and wheat Iht skilled artisan will 
appreciiie that equivaleni changes may be made to any plant proiox gene having a scnictuit 
wifTicicmly similar the pnnoa enzyme lequenoes shown herein in allow alignment and 
idemificaiion of thoic an\ino acids which ait modified aocording to tlic invention to generate 
inhibiuM-Rviuani forms of the enzyme. Such additional plam protoa genes may be obtained using 
standard icchmquck a» described in Intcmaiional application no. PCT/IB95/D0432 fikd June i. 
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DNA niDieculcs encoding (he heibicide fcsistim ptoio& aodinf atcyenocs uuthr hmtii 
mky be gcMictlly enpneeted for opiifnil cxpreSMn in icrap pUnL Thts nwy lAciude xtaerinf 
5 ihk coding wquenoe of the restsunce allele for opiimil cxpicsuoA in the crap species of iiNcivsst, 
N^cthods for modifying coding ieqiMoes lo achieve opiinOiAl expnsion in » puticula- crap 
i||fieies ire weU known (lee, e.f. FtrUk ti cl., Pmc NatL Acad SeL USA U: 3324 (^991); 
Kjasiel €t oL, Bioh^chioL II: 194 (1993)). 

GeMJcally «ngifieernig a pracox coding sequence for optimal enpreuion mny alto inchide 
10 o^enbly Unking the nppi«priiie icgutatoiy lequenM (ix. pvMMcer. signai Mo^nenoe, 
tr^tffetcriptionil ttfmineion). EuraptesofptomoicncapMeof fimcikimAgtnpliiiuo^ 

(i)c.> choic cipabte of driving cxprewon of ihe ntincatcd mevml gcnei mch ns prooa in plane 

I 

cdls) include the eauliflow sr mouie vims (CaMV) 19$ or 355 pnamoiers and CaMV douMe 
pmnoien; nopaline lynlhase pronioieri; palhotenesi»-icUiied (PR) pmin promoievt; imall 
«4^bKUiit of ribulow: bisphoaphate cartoaylaie (sMiRUBtSCO) pramom. and tfie like. Ficliemd 

p^DfUoieri wilt be choic which confer high levd oonstinitivc expression Or. mofc ptcfoaUy. dwne 

I 

«|hftch confer ipccific high knrel expRSftion in ihe vmm% mwepcibfe lo damage by the herbicide. 
P^erivdpitmoienaie the hoe actm promoter (McE^ l4ol,Cim.Cfn€UlSi\ 150 
( j99l )n maue ubiquiun pfomoier (EP 0 343 9:6; Ta>ior at. flam CWi R^p. 12: 491 ( 1993)). 
ahd the Pir-I promoter from lobaccoM'ui^ufiejprii. or maize (J.5. Puem Appltcaxkm Serial 
ho.0S/18U7l ioRyals^rai.,mcorpQraiedby refcftncchcfeininiucminty). TheprDcnoien 
I) enMclvet nuy be modified to manipulaie prompter itftngth lo inc<«aie pnitox expresaion. in 
af^ordancc with an^recognized procedures. 

j The mvenioTk have aiko diKOvcrvd thai uiother pftfened pmnoter for ukc with ihe 
2« t^tb«(or-rvu4unt pmoi eodmg kcquencet i% the pnmoter associated with the native pituQX gene 
(jx. the pratox profiioicn srr copending, c^hiwi^ VS. Provt»kma) Apf^ication entitled 
'pmmaen from Protoporphynnogen Oxidai« Geii»'. filed on the uune day a^ the pmem 
abpltcAiAon and incorporated by refeience herein in »ih entiftfy.), The pratox promoter Mquenue 
from an Arahidfipsts gene U lei forth in SEQ ID No. 1 3. 
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Since the procox pramocer itself is suitabk for expvttuoo of inhibitQr^evtttim pvctD& 
coding; i«qiieACCK, the modiricitionstMg^ hctem may made diiectly on the miive procox 
preaem IP die plant ceil genome wiihoot the need to co^^ 
icguUiofy sequences. Sochmodifieationftcanbemade viidiiectodmuti(^ 
s as homologous reoomWnickA and selected fc)rb««edm 

ptenocype(itf«, e.«. Example la PazkowOuef oil. 7; 4021-4026 (l^X and U.S. 

Fwm Na S.487.99X paftkailarty colimau 1^-19 and Example 8). An added advanoge of this 
appnwch i& that beside containing the native pfotoic promoter, the rewltifig modified gene will 
aUo inchide any other legulatory elemems« such as signal or transii p 

10 wiiidiarepaitof thenaii^gene. 

Signal or transit peptides may be fused to the protoi coding aequeaoe in chimeric DNA 
eonstnicij of the invention to dixMtran^ort of the expressed protoxcnaQfine to the desir^ 
of action. Er.amptes of signal peptides include those natively Ihiked to the pUm 
pathogenesifr-tvtated proteins, e.^- M-l. PR-Z and the like. See* e.g., er oL, Plant MoL 

15 BioL / /:ft9^94 (1988). Examples of transit peptides include the chloroplast iransit peptides such 
as those described in Von Heijne et oL Plant MoL BhL »tp. 9;l04-t26 (1991); MtmtiaL, 
PLmi PhysioL 85: 11 10 ( 1987); Vom er oL, Ckne 65: 59 (1988), and mitochondrial tranNi 
peptides such as those described in Bouuy et al,, Natim J2ft 340-342 ;1967). Chiotoplasi and 
miioehondrial transit peptides aie contemplated to be paittculariy usefttl with the present 

30 invention as proioi enzymatic activity typically occurs within the mitochondria and chtorapiast 
Most pre fened for use are chloniplsst transit peptides as inhibiiion of the piotox enzymatic 
activii> in the chloniplasu is coniemplaied to be ihe primary bavs for the action of proiox- 
iAhibiting herbicicks (Wiikowski and Hailing. Phnr Physiol. 87: 632 (1988); Lehnen ei aL 

^ Pestic. Biochem, Physiol 37: 239 (1990): Duke «r aL. WeedScL S9: 465 (l99i)X Also included 

35 are sequences thai result in locAlizaiion of the encoded protein to various cellular compartmenu 
such as vacuole. See. for exampU, Neuhaus tt ^L, Proc. Natl Acadi ScL USA 88: 10362- 
- 10366(1991) and Chrispcels, Ann, Rtv, Plant Physiol, Plant MoL BiaL 42:21-53(1991), The 
lelevaiu disclosures of these publicaiioits an: incorporated herein by refeienot in their entirely. 
Chimeric DNA construcUs) of Ihe invention may contain multiple copies of a promoter <r 

30 multiple copies of the protoa structural genes. In addition, the eonstructts) may include coding 
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•cqveDca for miri»fs and codhig aequenoes far 0^ 

eidk in proper readmffimip with 0ie other fi^^ TTk 
prepantkin of sucii oonsmicis «e within the 

Useful twkcn MiKle peptides provk^ 
S forexaiT^»lc«resisttneeU>hyeniniydA,k^^ . 
g1yphoute,phosphifKXhnciii,orthelike. Ttacmvkerscan beusedtoietflctoelticrimfomie^ 
wtththechixnericONAoonsmiasoftbehnwnii^ Othcriudul 
iniffcen aic pcpiklie oaymci whkA cin be evsi^ 
edtor itictMNi, for cxtmple lucifenfe. fl*f Incuiofudue. or B^ictiottdwr* 
10 Theimm^nimDNAmkcuksoftiKin^'eiktm 

number of vt-tccpgnizedwuys. Thoie slulkd in the in wiU «|spreciete ihit the choice 
rmght depend on the type of plant, U.inonocflc or diooc,t«fseted for ir^^ Suitable 
methods of transfonning plant cells include microinjeetkm (Crossway tfi oL, BioTecM^y^ 
^:320-334 (1986)).«lectropontian (Riggs ^di Proc Natl Acad SU V5A AJ:S6Q2-S606 
l& Agrohacurium mediaied transformation (Hincbee el at» BiMdiHoU>gy 6i9\5-92\ 

(19SS)). diMt gene tnnsferlPaszkowski ef oX^ EMBOJ. J;27 17^2722 (I9S4)X holistic panicle 
Mcekration using devices available 6om Agraoetus, Inc^ Madoon. Wisconsin and Diqxmt. Inc^ 
Wilmington, Delaware (jteJormmpU, Sanford U.S. Patent 4.M5,050; and McCabe et 
aL, Biotechnology 6^-923-926 ( 1 and protoplast tnnsfofituiion/regenemlon methods (jvr 
20 U^. Patent No. 5 iswwd Sepc 27. 1994 lo Ciha^igy Corp.)- Also see, Weissinger tt 

aL Annual Aev. Cenei. 22:42M77 (1988); SMnfofdetaL, Par^ulaie Science and Technoiagy 
5:27-37 ( l987Konion): Christou et at,. Plant PhysM «7:67M74 (1988)(»oybean); McCabe ct 
oL^ Bia/Tcchnohgy 6:923*926 ( t988Kfioybean): Dana «f at, Bia^echnatagy 8:736-740 
(l990Knc«): Klein ei aL, Proc. NaiL Acad ScL USA, 55:4305-4309 (l988KRttizc): iOeifi ttoL. 
Biorftchnohgy 6:559*563 (l988Xmaae); Klein aL Plani Physiol 9/:44DU44 (l9BBXmai2e); 
Fromm €t al,. Bie/Tcchnotogy 0:833-839 (1990); and Gotdon-Kimm et aL. Plant Cetf 2:603-618 
(1990Km*iic). 

Where a hert>icidc r»^istim pvocoK allele ts obulned via difocted mutation of the nuive 
gene m a cmp plant or plant cell cultuic Troni which a crop plant can be regeneraied. it may be 
moved lAio commefcial vaheiies using mditiQAal biwding techniques to develop a herbicide 
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j (Dkfimciopwitbcwt the need for imtk^ 

I mmffu ii iu ng h ima ttie plinL Aliemttively, the herbicide mbtims^ 

I 9eMise«J)y engineered for optimal ezpieftuoe and then Uinsfonned iiMO the desired vviety. 

I Genet encoding >teedprotoxitsi»ttrtie» 

I 5 merken in ptaieeUtrsnfonntton methods. For exini^ plans, plun tissue or pUntcdb 
I tpinsromied with mtr«A$tc*^cin also be cmsforvnedwi^ 

I c^lecrbeing expiessedbythepUuiL The thus-«nn«rcmed cells tie timferred to nnfim 
I comiining the protoiaihibitor wherein only the Piocox inhSMon 

contemplitedtobepirtkderiyusefulisseieeti^ (c^, icifluorfen, 

}e S-P-ehkm-^KtrifbrniMthyl)^^ acid; its methyl ester, or o^uorfieti. 2- 

chkM^K3<dkoiy^4HiitioplmoiyH-(irini»ev^^ oxidiirnto, (e-g. oudiiMU 3-{2> 
dichlofo-$K 1 -mcthy tedwty )phcnyl W 1 J-diii«diyk;diyl^ A4-oaadUiol-2'<3IO-<»e), qfdic 
imidci (e^. S-23142* AK4<hloro-2-fluoity5^repargyloxyphenyl^ 

dikmphthalim. N^4<hk)iDpheflyI>3,4 J>u«ihydro^^ phenyl pynzoles (eg. TOPP- 
1$ ediyi, ethyl 2^ i-aJ^4-tnchiorOf)heny))^'tiDpyr«^ MftB 39Z79), 

pyridine deri vuives (e.g. LS il456), and phenopyliie and to Oiihenylpynolidinth and 
ptperidinoearbamate analogs. The method teapplkiable to any plam ceil capable of bemg 
transfonmed with an alieied pratoa-encedirtg gene, and can be used widi any transgene of imerest 
Expression of the tnnsgene and ihe pmoiL gene can be driven by (he same pfOtn^ 
30 on plant cetU« or by scpnte pnMnotcti. 

Modified tnhtbitor-resisiant protox enzymes of the pieaem invemion an leiisiant to 
herbieicfes that inhibit the natunlly oeeufTing piotoa activity. The herbicides that inhibit protox 
include nuny diffenni stnicwrai clwes of molecules (Duke el Wted ScL i9: 465 ( 1991 ); 
I NoAdihaili tta!,. Pesticide Bioehem. Pkyticl 43: 193 (1992); Mairinge €t 0L, FEES Un. 245i 35 
I 7% (1989);YanaseardAndoh,FMirAi^Aibc4rm./'Ay«i^ 

I dip^ytethcrv (e.g. adnuorifen. 3-[2<hlocD>4^tnnuoroncthyl)phenoay]«2-nttiobczoic add; its 
I raihyi ester or oxyfluorfen, 2<hlorc>-K3-ctho)iy-4-nitiophcnoxy)^trinuorobensen^ 
I . O)iidiazole& (e.g. o&idia2on. 3-(2.4-dk:hUiio-5-(l-methylcihoay)phenyl]-5^1 J-diinediyleU) 
U.4-oiudiaz£>i-2^3//>on!C), cyclic imides (e,g, S-23142. A^i^hloto-Z.fluuro-^- 
30 propargyloxyphenylh3i4 J,6-lctnhydfophthalimi^: chloiDphthalim. ^K4<hloropheny]>^^»4^«6- 
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30 



nhiP|7i«2^y^^SH>xy]pi«piofiiie; M&B 39Z79), pyfidioe derivatim (e^ LS S2-556)* mid 
Ite dipbci^kthm of pttiicultf jd^iifk^^ 




(FonnuUI) 



wtKfciB R equik -OOCWa (Fommlt II). -COMiSC^ 

(Fonmilt rV; tee Miigftn cr oL, Brighstm Crop Prot^ctim Omfewence^Weedt: 47-S I (1999)). 



Additkmal diplMy kshcn of inierest ^ 

^CHiCOOCHi 
CCHiOCH, 

(ftrniuU iVa; Hvyishi «r ol, Otv PrMeetiom Cetifertnce-Wtedx: 53*58 (1989)). 

An ttttfitional diphenykdia' of inicftst if OIK hs^^ 

pooc^ 

Ci- 



CI 



(FonnuU rVb; birenox. ue Dcsi dJL, P/oc. Nonluatt WeedSd- Coiif. 27: 31 (1973)). . 
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mi 



wbeiBtDQequili 



O OR 
(FonnuliVI) 



N- OR 0 OR 

(FonnoUVII) (Fgnnala VHD 



lie 



OR 



CHi 
(FormuklX*) 



OR 



CM, 
{FomwUDCb) 



(RonBuUK) 



IS 



(Me Hemper tfr dL (1993) ui ""Praceedhifi of the Bghdt Im^^ 

Chaniitiy**, Ragdale ef aiL. eds^ Amer. Chcnu S«e, Wuhingien, D-C, pp.42-4t (1994)); 



indRi «|iuJs H.aorF, RicqtMlsQandRj tiinopcifcally sttbtficutf^ ether, (hioetficr,4 
■mino or alkyl group, Aliemitively» aad Rj tofcthcr nuiy form a5 or 6 membefedheteRsydic 
ring. Ejufnpleiof imidehertikidetofpArtiGulirrAm 



3C 



(KonmilaX) 
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Vv 



0OOCH(C»Us 



(Formula XI) 



HsCzOOCCH^ 



(jMMiiimAaL Briglttim Cnp Pmeaiem 
Cai^emut-Wttdi.' 35-40 (I9»3}} 




»VC00CH (FonnulaXro 



0 oowoCfHfi (FbnituliXIV) 



0 O-CHCaCM 

(FomiuUXV) 
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hqmO-O^ 0 (FomHiUXVI) 

T»ehert»icidilrfvltyofthe<bo«ainipow^ 
/»/ Brighten Cf«p Pnuetum On^rmtet, Weeds (British Crop PnnecbOB Omol) (Fwnute 
X md XVD. Pnetedingsqfihe t99S Brighton Oi>pPnuaiim Qm^rmct. We«£f (BritiA 
Cmp Pn-caioo Cc»»dl) (Foimul* XD -nd m U A to 
>ttar«tftri^lto We€dSm*SeeUtftfAmtrieA voL 33, ps- 9 (l993XF««wal» XIV). 

(enentfonnula 

COOR (FbnnuUXVlD 

5,1B3,A92, herein incorpoMfid by idcftlKe- 

AJsA of fiignifoiiKft herWcuto hftvin^ the |en«il fonnuli: 




(FbnnuU XVIU; thtMtisimin) 
Weiter #t dt. Brigkum Crop PnH€Ction Cottfemct- 
H'^wii. pp. 29-34(1993)); 
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JS 





HF2 



CHi 



(FonnuU XDC; csfuiUAZOoe) 
TKMKfr. pp. 19^22 (1993)): 



wtetm Ri U Ci^ilkyU opoon^ly aibrtittiied by one or moce hdofcA iwois; 
Rj tt hydrogciw Of » C,.Cr^ke*y, «ch of whtt 11 0^ 
moKhi)oteaatofni.or 

R, tftd Rj i^jether from Ihe powp KCHiVXs wtat X u ha 
Ri if hy^vgcn «r hilofcn* 
RiUhydwgenorCi-CHtfkJfL 

Rj is bydrofwu litre, cytnoorthc group -COOfa or-CONRJ*i, m 
fcUhydrog«i,C,-Cr^UCr^-itt«iyIwCr<^^ 

f intoiuliontl pitcnt puMiCitiOftS WO 94A»99f». WO 93/10 W Md 
U. S. Ptfenc No. 5.40S.S29 i»isned 10 Sdmni^); 



18 





'Si 



S-fbaty[pymeki, aiA m: 




(FonnttteXXb BipyncMen) 
f»9 621 or Pettkifc MullMl^ 9th oL* fay 
Oop Pi^tactiop Council, Stmry (1991)); 

and >Hwbttiiutcd-2-«yM,9,6«7-ieirih>^iiid4it^ (Ly^ €iaL PgffkUe ScL 42:29-36 (1994)). 

10 Uvek of berbidde which nomttnymb^'biio^ 

^licalkm fite» kiiom in ihe an. md whkh depend partly oo cxtemil ficton ludk «s 
Cfl Wrowwent, cimc and method of Kpplicatkwt. ForeJurnpk,ditheeaseof theimkkheriNd^ 
i^reKntcd by Fdnnulae V through IX, and more pamculariy diose rcpitwiicd by Fomuiae X 
thnMigh XVU. (he appVicaiioo nies range from 0*0001 to to 
15 kg/ha. Thi» dosage rate or concentration of herbicide may be difTerent, dependiiig on Ihe de&itcd 
action md particular compound uj«d. and can be detemuned by methods brawn In the art 



I The invemion will be further described by lefemoe to the followmg detailed eumpies. 

I TTtf se exantptes ai« provided for purposes of illustration only, and are not Intended to be Unucing 

j 30 unless otherwise specifted. 

I 
( 

I EXAMPLES 



Standarxf TOombinant DHA and molecutar ctoning techniques used here are well known in 
'the art and aft described by T. Maniatis. E. F. Fritsch and I. ^amhrt^ok. Mol^lmrrtentny A 
^^him^'Ty m^""*^ Cold Spring Haibof Ubonaory, Cold Spring Harbor, HY ((9&2) and by TJ. 




Silh«vy.MJ_BeHMn.«ndL.WEtiquitLe3CpefWwtei^ 
Liboratoiy. Cold Spring Hatter. NY (1984). 



EXAMPLE 1: iMlatkiacrawliMtPnteE-lcDNAhtMdoB 



Totd RNA pt^md from Tritkwn oeftiytpn (cv KAAzlcr) was submitted to Qodtecfa for 
ai«(offleE7NAI(bfiryGORtmiakx)iote Appnmimitdy SO^pfuof 

the cPNA Itbraiy MS piaiccl il a dmQr oTappf^^ 

dapUuftt filter lifts wm made onto iiiiit»eUukjic membdBies (Schleidier and SdmeO). TTie 
io pUqtietifUwmprabGtf with the maochoioK-l cDNA(SEQIDNo5;jMExBn^rie2of 
I Ittiamaiianal ^ipticatioA no. YCtmWXMl fiW June 8. publiated 2i» I9» a$ WO 
I 9S/346S9) UbeU wttfi 32P-4CI? by ^ mdom pii mmg meihod (Life Tochnoioika). 
H/bnilizaiioo and wash coodhkm were at SO^ C •$ de^^ 

Acad. 5cL USA Si: 1991-1995 (1984). Ftositively hybn^Jing fAM^^ics wctc puhTicdand hk Vivo 
IS otctsedintopBluacripiptasnuds. TheiequeneesorthecDNAiBscnswciedeienniA^ 
duii) icfmiiutioA method using dideoxy tenninatan labdad with nuoccscem dyes (Applied 
Biosyiienis. Ine.). The kngest when cDNA obtained lirain initial icitemng effocu* riesignamd 
'wheatPmox«'l\ was 1489 bp hi length. Wheat Procoa- 1 lacks coding lequenee for the transit 
pqitide plus approximately 126 amino adds of the mature coding sequence based on comparisdn 
ao with the other known plamprocoi peptide sequenoes {Table iX This paitial wheat protein 
teqoenoe is 90% identical (94% simiUr) to the maoe Ptotojc-} precetn. 

A second screen WIS perfomied to obtain a longer wheat protoxcDNA, For this scteen a 
Tntictm a£stivwn (cv fCanzler) cDNA library was prepared internally using the lambda Uni*Zap 
vector. ApproKimateiy 200.000 pTu of the cDNA Itbtvy was screened as indicated above, except 
thai the wheal pmox*! cDNA wis used as m probe and hybridization and wash eonditsons were 
al 69^ C instead of 50OC The longest wheal cDNA obtained from this screening effort, 
designated 'wheat Piocos* la\ wu 18 1 Ibp in length. The nucleotide sequence of this cDNA and 
^the amino acid lequcnce it encodes is let fonh in SEQ. ID. Nos. 9 and 10, respectively. Based on 
compariton with the other known pbnt proto« peptide sequences and wiih corresponding 
genomic sequence* this cDNA is either fulMength or missing only a few trensit peptide codons^ 
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fhis wheat pracdn aeqinoc is 91 % ideiitktl (9S% «M 
• •equenoe M focth in SEQ ID Ko. 6. 



liobtiod «r a loxbttD Pmtai-l cDNA 



EXAMPLE 2: 



A UnbdA Uni-Zvp ^NA libnty pn^ttnid from soybean ( v WsUiams 82« epicocvb) wis 
pnchaaedfitMnStntagene. Apprnximatidy 30AX>pfuoflheUbrafy w»pUi(4a^ 
appTMimately SJXWpfii per lOcm tari £sh apdilaplieatt filter lifts uMie nudaoMo 
CotoftyyPtiqpe Scicen roet p bnnea (KEN DupomX The plM|ue lifts were piiaMwitb the 
AiMdop^PtMX'X cDNA (SEQZDNo. 1; JMExmpfe 1 oflmmatiooal^ictfianTio. 
PCT/IB9MX>4^Z niad June 8« 199S, pubitsM Dec. 21,1995 as WO 95/34659) labck4 with 32P- 
<iCnP fay the random primmgmediod (Life Te^^ HybridtziciafictiiMGtiMs wt7% 

iddiimidod«eyl$utfaic(SDS),OJMKd>04pK7.0afBM WashcondiikM 
were2XSSCJ%SE^atS0C toitivtlyhybfidizing^a^vmpuriM 
imo pBlucsoipt plwmds. ThescquenocoftbedMAinietts wasddenniiiedbyte 
lemuiuum method using didaay lenniiutais labeled with fiuotcccent tfyct (Applied Bioaysiam, 
Inc.). The loil|e«t soybean cDN A obtained, desifnaicd ^soybsaA Pracox-l '» is fuR-lcngth baaed 
on eompariiM with the other known plamprMM peptide sequel^ Soybean Piotax- 

I is 1847 bp in length and encodes a protein of 58.8 kD. The KHeminal peptide setpietnae has 
features chancteristk of a chloraplaMiraAshp^Akk of approxii^ The 
soybean pioteia is 78% identicaJ (87% similtf) to the Aribidopsis Protox'^l protein. 

Soybean Protox* I , tn the pBhiescript SK vector, was deposited Ooccmbcr 14, 199S as 
pWOC*12(NRRL«B-2l5l6). 



4 



i 



An aiigncneni of the predicted amiho acid sequences of the respective proteins encoded by 
the sequences shown in SEQ !D NOS: 2 and 6 are set fonh in Table I. An alignment of the - 
predicted amino icid seqyefices of the itspeccive proteins encoded by the sequoices shown in 
SEQ ID NOS: 4 «nd 8 are set rofth in Table 1 
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TABLEl 

ArtHMoprii rPtvte-l*'; S£Q ID No. 2), Mabt rir.ipnrtMi'^r; SEQ D Ffo. CW 
Wbc^ rWlMtpU**; SEQ ID NO* 10) ««l SojbM (**SoybMi^-*; SEQ ID Na U) 

kk&tkAifcsiduei are denoted by teven^ AlifmMa 
peitomd ttting the GAP pfognm described in Devoiiu «f aL, HmcUicAeidsMts. !2Mt'i^ 
(IW4). Poshjwwbidk may k» modified Moidiiiff tote 
nliibiiiv miiianec m shoMii in 

Hxprttoic-l ♦ • * 

whMtptl * 

S«yfaMnpt MVSt^FMSlLP PPHQfTLUlM IMSVrSrm FTflXmMf IfTlLWrSM 

»T«to«*i KlULLRpr TQSLLrsm muurvticf Lii.Lftesm0 

NxpcOtoK-l W AD CVWOOCXSG LCTAOAlATR MOW..OVLV 

NKMCptl ♦ • • . . - 

g«ybMA9t CSTA^PPrnt OSAP VD CVWQOOVSfi LCIAO&CATK KAMA. .NWV 

PfoccMc-1 crrvossKZE o u u smm cvrvoseiss tczAOAunc MraMPNLXv 

101 »o 
Mtprotwi-t TMAKmi trrmvtm mMMMovw owpvltha vmbubolv 
wh«AcpcX » * 

SoytMiipt t^AADRVOGM XTMSt. .06 YLHCttfllSP OtSSPKUfW VOSSUMbV 
' rroto^'l TtAXDRVOCat ZIT..WMG PLHSCfifMSP OVftOMLW VDSGWDLV 

HxprotOX-l rCDrMAPfW LMBGKLUPV? SOAObPmi UCSZ>CltL» OAALfilRPP 

»ftl-Atptl 

AoybMnpt LSOPtUPHfV CJWnUO^PVP 6KLTPLPPP0 UlStOCUlU OfCA&CXHPr 

rrdCOK*l UCOPTAWV UOCKLRPVP aW,TW»ffO UCSIOCKZAA OVOACtfimM 

aox 

Nvproto>-i ^PQur^vce rvMUfLGACv r0iLiErrcs craaDFsn. skkaatckw 
HhMtpti PFSifcccsvvE rvMNifiACV PBu^zcpfcs GVYwrsn SKKAAracw 

S9yL*«r\pc Pf^HUSVEC rVKAHlCDCV fMIEfPCS StdCAAfCfCVIf 
ProtOM-1 PPCRECSVEfi PVMMJGDCV PCAUFPfCS CVYMD^ICL 5nKAArCK\M 

Mzproto^-1 AUUTQCSIZ Oc^XKTZQeA SKMPKPPlUA RtPKPKOQtV ASPRXCUKL 

, ^ACptl RLtEXGCSXt OCtZXAlQClkt C1CN?t(^PIU)P IU.PAPKCOTV A5PM0LANL 

i*.4eyti«4npt KI^CJtMOCSIt OOTPKAIOER NCASRPPXDP RLPXFROQTV QSrWCttML 

Proto«-l ^gWQCSIl OC»rKAIQE|L KNAPKAEIIDP KLMPgCOTV GSmCLAKL 

30 X WO 
HiprOCftiC'X PNAtTSSLCS KVKUmitLTS XTKSmUTW LVrcrPSQW SVOAXmMT 
Wh4«tptL niAlASIUJGS rVKLSWICLTS XTKAOifOGYV LeYtTPfiGLV 8V0A»^XIfr 
SAyte«npc POJ^XSAIOCN KVIltSWKLfS ISKUMSCEYS LTtfeTPBCW SLQCKTWLT 
rr«tO«-l PtAZSAlUXXf KVKLSWKiA: 1TKLC5GCTM LTYTtPDCtV SVQSltSWirr 
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m 



ID 



IS 



2D 



2) 



Frotox-1 



MhMCptl 



TABLEl 

351 400 
IPTTVASKXI. ftPLSSDMBA LSHTtYPPVA AVTVSm» lUEtLIDCE 
TPSYVJ^Xt RPI.SIIMM LSRFmm AVTVSmEA mCSCX.2XlGB 

XFfimsTLL msMuUDK Lsxrmpvj^ AvsisYpm vseciivsb, 

VPSRVASdiL RPLSBSAAm USKLTmVX AVSZSYFKfA XICTECLXPfiE 
401 450 

LQOT^LHPti sgcvmcTX YsssLmou F»vLUJir xogmhtciv 

LQSrOQXilFR SOCVSTLCn YSfiSLfPHRA PMKvUUnr JOGSHmiV 
UKTPQOUIPR SgcVSnorX TSmrFHRA PFQRVUIJV ZOGXnfrGXt* 
LKCragLRFR TQOVmCTl TStiSLTPfflU nttRXUAUr lOGSTWCClL 



4S1 500 
SmSELVU VDRDLRICKI.X MSTAVDPLVL GVfCVIffOASF CFLVmTJLT. 
SKTKSOLVGA VDRDUUQCLl KPRMDPtAL GVKVWPQAZP iJFtiZGKLDAL 
SICn)SKZ.VET VDROLRKZU MPNAODPFW GVRIMPQMC? giXVCHUDLL 
SKSSGELVXA VDRDUUCMLZ KmSTDPLlO* CVJCVMPQiaP QFX.VCifFDZI. 

501 550 
kAAKAAXMO 0TQCSLFL09V WAGVaXCRC VBOAYSSASQ ISSMTtKl 
AXAXSALCQG OtDGLTLQCM YVWSVALQIC ZCSAYESXSQ VSOTLTKYAr 
DVAXASIRKT GfQQLFUXal YVSQ^^UiGRC VEGAT5VAAE WDTttMRVY 
DTAKSSl-TSS CTGGtFLOOf YVACVALGHC VEGAYCTAZE VWM^YAY 



30 



35 



551 

Ktpr»cox-i K* 

WtMACptl X« 
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ComnriMB flf AraMdofiris (SEQ m No. 4) I 
Maitt ^EQ m NO. 8) AfldM Add r 




IS 



20 



PdCcttt Similarity: 75*^ Ftteem laemity: 57.9QS 
PR>cox-2.Pep X Mzpmox-lPep 



1 WLSaKVAt> , KOUAVtiOCRVJkV 21 

32 VOUSVSGZJMYICLICSRaJIVTVPBATO "Vl^ 

lllllllllllhl: -l:|||ll|ht|.llhl- = 
SI VOMTVSmUUOKUOSSVllVTVrEM^ 100 

|.:.|:||ll|.:(Ihl I M Ml I = 0 ■ I • ^ 1 1 . : | M u 
101 HTGCCUEA5IU.20DL6L0OK9QVPI^^ ISO 




3S 



30 



35 



40 



4S 



50- 



122 SSVLSTQSKFOILLBPFLMXK ICS8KV9niW£SVSSFr0W£POQE 

II||IM|:..--Ul)hll «fUI|:- -lllt-l 



l€7 
300 



217 



16B vvnY;«zDPFvoGTSJua)FPsx.SMiai5rpotJimvncsf^ 

|||h:|||| = lIlhl|:l|l::MMIM:.H|tmil Jjl „^ 

201 vvpyrvDprvACTSAcppe^x^iuu^AxicNznicy^^ 2S0 

218 AJqSCKSlUgKSSPGTta ca SB Q SFSri^^ )S7 

251 AXCDPVlCtlUfflSfiGKRIWlUlVSPSF^^ 3O0 



3«6 
301 
313 
351 
363 
401 
413 
451 
462 



501 



\a5tS..n7dGSR0tNWSLSCVSfA'&TQRQ.«.KPfrrD^VZ^ 312 

nil. =;:.. :.|| = |. |. h 011111111 = 11= 

EKICVhlCGC»PP^U'rL«im?t5VLmm«^^ 342 

II. MM. |:l|l.;:|i||hMM"ri-IIMIIIIlllll I 
e.0KH6mu7aP5swfmR5Pso^naYTtrioc5w 

I tll|:|lll(lllinill|.|.t M. I 

KOVVTSDLCKIMVBSEPVSVNKTVMUgim 4«1 
IMIIII.MIIIIht. |.| II ,11111: -l.thllhlll.: ^ 
KQLVTSDXJOuIiGVDGOPTrVIUlVytaCN^ 500 

LKPrTACNHRCXn.SVClCSIASCCKAADLVlSYU&SC5IIDKXPm 509 

lllllltll ::|M|. Illhlllll.llllll : 

IfK^PrtAGNSWMU^VGSVlXS QSI U MM StX^HrtiQiHSH* . . . 545. 



'03fO?|NS(l)MI|01^ li;B S K B 



P. 2! 





EXAMFLC3; 

Uqiiid cttlwm of PtoioxOySASX3B, Proiox-2/SASX3t «nd pBiueacripc/XLl-Bhie vat 
frwDDiLttnplW* OQelHinditdiiuai^iieraliqvict»(rfe«dia;&^^ 
nm&A cofitttfiing vitious ooncciitrttioiis (KCdM*]OiiiM) of % pMm iAhibttovy ifylmci] hoMcidc 
offonnultXVIL ZXipliCMiettof plitea wtreiftfubited'for IBI^^ 
comptcte darianest. 

ooncoimtkm, cooiistetf whh ftpor 4re$isttittof theiutivelD«ctaialaa]mietoi^^ 
hEfbickfes. 11)efroCf)x-l/SASX38w«scletf)yseittitive, wTibte 

6bmiiiiledby inhlbkoreonoa^^ ThenDCOX-2>SASX38 wtsntooiEntUive. 

bin(mly«ahi|herooncc«uiit]OA(IOMM)oftbeha^^ TbedTMaftebeibkidieonboChpltnt 

praiox itniiu was {Boft dnmitk nt loM^ 

thcdvfc. The Uk&kiqr of the hcfbidde WAS entirely diniix^ 

(othepljte*. 

The dirretCAt hertMctdetoleniKe between the twoplmt PMox ttiins Ulikely ccwh of 
^mntuJ expiession from these two pltsmidft^ndier thin Any to^^ 
icnsitivity. Protox*l/SASX38 pow& much more slowly tti«n ftoiox-l/SASXM m vq^ beme- 
defieient media. In addiiiocw Che MzProiox*2/SASX3S stnin, with i gfowA rue coo^Mble to Anb 
PM0X-I/SASX3S. is also very sensitive to herbictde At the lower (10-IOOnM) oonceomikms. 
tnutxl chafacterizAtiOfi of the yeast ?rotox-3 clone mdieued dut it also is herbicide senstti 

EXAMPLE 4; Seleeting for phmt pntox tenes rabUAt to prvtox^ahlbilory herWddes In 
file E. eoU cKprasloa «y«teia 

An AjibtdopsisthAJUnA(LaAdfiberg)cDNAlibniy inthep^ (Mioeter 

ol.. FUihi I 2:4I7«422 (I9») was obcained and ampHfied Hie E coli hemG mount SASX38 

(Sasamun tt aL, / Gcfi. Microbiol //i:297{1979}) was obtained and maintained on L modia 

comainmg 2(Xig/mI bematin (United States Btehemtcali). The i^asmi^ libnry was tr aio famcd 

into SASX38 by elec tro poroion using the Sio-Rad Gene Pulscr and the manufacturef's 

eondttionft. TheelectioporviedeeiU wempUiodoo LacareoACainiof 10l)ugM nmp 

density of appnximaiely$0CUXX>trinsfonnantt/10emplaie^ The cells wem then ineubated at 37^ 
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C for 40 hcxn in k)w Bgto selected for the abUity to 

heme Heme pfccotnphs %tre ncovcred il a fn^otKicy ct4C0f\lP fnotn die pFLSl libniy* 

tesgBated *Tro(ox-l / the praiox frne cxpe^ 

The pFUl Ittnof is a expmnon fib^ 
bidirectiaiiaily. ThetecDNAffatfabobeexprtSMdinbacterii. Hie protoi dm As tppMnUy 
imtiiteitaniiHfiiiieATOmcheyeittJ^ 10iiitboKals5*fiotbc 
NoU clonrng Mte in tbe i««dtDr «im1 «e espnssed tm^ 
fuither upstream. BoCitiie PrxoxM cDNAs thai included sifnifte^ 
10 inAsitiequcncciihSNKalthepin^of ^ 
amoum of ehloroplett tnnsk tequeaoe ttad^ 

o^enmotts. This done, pSLVi9.eonulitt only 17»nfiioacidsof ibe putative chtoraplast 
tFB&sit peptide, wiih the DNAiequenoebecinniogatbp 15) of d»ArabidopsuProcox-l 
(SEQIDKO, 1). 

I IS ThepUitmdp5LVI9 wistnmtoinediiUDthenndofninui^^ 

(Stnugene.UJotkCA). TbeCransfonnaciofl waspIaxedanLfncdUemta^ 
amptdllin and incitbatcd fa 48 ham at 370C Uwns of traasforrnedoelb «m screed fiom Che 
pUiC$ and pUsmid DNA prepared ttsing dte Wizaid Megiprep kU (PiDn^ep, MadUoo, WI), 
Plasmid DNA ttolaied fiom this mutator strain is predicied to contain approximately one Fandom 
30 base change per 3000 nucleotides (see Giecner et oL, SiFuiegies 7r2;.'32-34 (1994). 

T>ieiTUiiatcdpl«*mkl DNA wuixinsronned into die bem^ et 
at. 1 Cen, MicrobioL I iI'J29l (1979) and placed on L media concaininf various concemrations 
of protox-lnbibiiinx hefbidde. The plates were incubated for 2 days at 37^. Plwmd DNA was 
tsolaued from «]t colonies thsi grew m the presence of heitticide ooncentmions that efTeetively 
» kaicd the wild type Ofain. The isolated DNA was dien iraAsfotmed into SA5X3S and plaited 
again on hertekkioensuiethit the leditanoeobwvedwMplasmid^^ The protox coding 
sequenoe from platmidi pissiag this seraen was ekfiised by KotI digestu^ 
- unmutagenized vector, and leatedigain for tteabiUiy to ctmferhetbi^ 
(cqucnce of protoi cDNAs that confetml heiblcide resiitancc was then detemined and mutations 
identiried by comparison wich the wild type Arabidopats Ptoiox- 1 seqiMce (5£Q ID NO. 1 ). 
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A tiftgfe coding tttfioiot muttnt wu iccovocd from the flnt i 
ThxtmuiimkadsfDenlittttedbetbickk^fesi^^ bcsoauanit 
C to A mutation ttmiefeotkle 197 uSEQIDNO. I m the tnincated chhmpUst tiaiiit seq^ 
of pSLV19« ccttveiting m ACG codon for TTutOQiM 

56ofSEQIDN0.2,«Mi«smtiiiginbeBErcofnpleiiie^ This 
plumkl also etiAUiiisssilem coding *6qu^^ tQ59»wids AGTCScr) 

cbaofiBf tDAGC(Ser). Ttoplaiinidwt>dMigiuitrd^to-l. 

TliepMut-l ptasmid was then transToniiBd into tfae 
■hove and Che mittaiBd DN A was isolated and plated OA an be^^ 
10 die vnmutagenized pMitt*l protox gene. HerWckk tolemi colonies w«i€ isoiaied a/ier two days 
at 37 C and analyzed as described ahove. Midiiplc piasmids «m shown to oootain berbieide 
resistant pmox coding lequences. Sequence analysis indkitBd Chat die lesisiamgcaesfcAlBio 
ihcetclaucs. Oneiwsianeemutadanidentiried wasaCtoTchingtntniid^^ 
AxihidopsisPRKon-l sequence set forth in SEQ ID NO. I. Tbu change oonvots* OCT codon 
IS for alanine at amino acid 220 of SEQ ID NO. 2 to a GTT oodon for valine. Bid was designed 
pAnC*lVal. 

A second class of bcitncide itsistani nuitant contains an A toG change at nubleodde 1307 
intfie ArahidopsisPraojL-l sequence. Ihischani^conveitt a TACcodcm for tyrosine at aoibio 
acid 426 lo a TGC cedon for cywinCt and was designated pAnC*2Cys* 
20 A ihlrtf rtsistammuunthasaOto A change at nucleotide 69! in the Afibidopsts Pmox* 

i sequence. This nutation converts a GGTcodon for glycine at amino vid 221 loan AGT 
codon for serine «i the codon position adjacent to the motalioo in pAnC-l. This plasmkl was 
designaced pAraCOScf. 

Rtsistani mutant pAraC-2Cys, in the pMut-1 plasmid, was deposited on November 14. 
1994 under the designation pWDC*7 with ihe Agricultural Research Culotfc Collection and given 
the depostt designation NRRL «2I339N. 
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EXAMPLES: 



S The iniitio acids kkntifvadtsheriikidei^^ 

by od» ttnuiK) adds iind tsstoi for fii^^ 
OUgoiiudeadMiraeted imiusenem of Ihe Ar^^ 

tbe TaiufwiWftf Site'DSieclcd MuMgcpcM* Kh (Qopiccli. Paio Ato, CA). AftBraaiiDoadd 
cbMigcs m eoBfiniKd by leQume miys^ 
10 indplaietfimL-«mp](M>iiiDdMtottttforfinic^ 
iflhftittiic hcvMdik to M fiir CDknn^ 

Thii praoedore U applied to die aUnino oodon It oudM^ 
SEQ ID Na 2) iid 10 Che tyiotifie codOA ei ituckx^ 

No. 2) of ihe AfibtdopMs Prpco<-l ifqucncc (SEQ ID N0> 1). TbefesiiltsdeiiioosMo that the 
is abMnegadCTiaiBuitoxidct68&-690canbedianyedtoacodOft 

cyttineorisolcucineioyieldanbctf^ickk-res^^ The 
resulCB further demottfinte that the tyras^ 130^1308 can be chaopd to i 

oodon for cysteine* isoleuctiw, leucine, threonine or methiomne lo yield ait herWcide-iesistim 
pracox ettzyme whidt retaiu fuAetion. 



EXAMPLES; 



laobttoo ofaddlttonai staUCioof thftt 

» of pmioysiy Uestlfkd raistaiU 



25 Ptasmsds eomaining heAicide itaiMim proton g^oes are cransfonncd ioio the iranatflr 

KiitinXLl-RcdaiKiiTwuiedDNAisisoliiedasdtacribedabo^ Tlic mutated plasmids are 
tfinsformed imo SASX38 utd the tniufomwiU ift scieened on hefhicide conccntmions 

^ iu/Ticient lo inhibit ^iDwth of (he orifinal 'lesistant** nwiaoL Toknnt coioaies are isotaied and 
the higher tolennce phenocype U verified la bdng coding lequetioedepefldeitt as described above. 

30 The lequenoe of these Duums Is detennizied and mutatioAi identified by oompariscxi lo the 
profcftitor sequenoe. 

This procedure was applied to the pAiaC- 1 Valmutint described above. Tberesults 
demonstriie that the senile codon at imiiio acMl mimber 305 (SEQ ID NO. 2) e^ 
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IS 



70 



25 



oodon fer kwifie to yield ai enzyme widi higher tokr^^ 

pA^^lVftl mount done This second ihcnnution is totgnMcdAnCJC^^ Theskne 
fesota n demdiistriked for the thre^ 

uokaKlfieortDitimnele«dstt>tiDontolenKeii^^ Ttaeehsogessedesigimed 
An£349Ile sod AnC249AlAt ftspectively. 

The procedure wis ilso spplied to thepAnC*2Cyi mutifit described shove. The itsutts 
denumstnie chst dbe proline codoci St amiiK) e^ 
for leudne to yidd sn enzyme with higher Uriei^^ 

pAraC<^2Cysniutinte)ooe. ThismitttfidnisdesipiiiBdAnCllSUa. Ite sim icsalts are 
demMstrsbPd for the seriM eedoQ at emino ici^ 

tDkrsmpAfaC*2Cyaeii^ii>e. This chsnfe wis also isi^Aadwidi the pAnC*lVaIinitti^ 
described above Slid is designated AnC305Le(i. Additioral mutations thsicnhanee the heibiride 
resistince of die p Ar«C-2Cys mutaiu include an asparagine to serine ebai^ at amitio acid 425, 
designaied AraC42SSer* and a tyiosine to cysteine at amino acid 49S, desrtftssed AnC49SCys. 

Tltese changes arc referred to as 'second Kite" matations became d>ey were found not 
stifTicieot to cortfef hetbidde tolennee alone, but laiher enhsnoe the Ainciton andAv the hei b ici d e 
tokiance of an already mutant enzyme. This does not pcechtde the poislbiHry that other amino 
acid substitutions at these sites could suffice to pioduce a heri>icide tolerant enzyme siooe 
exhaustive testing of aJI possi We replacements has not been performed. 

EXAMPLE 7; Ideatificatleii of MkliUoanI aitca In the malie Protas-l fene that eu 
be mutated to give bcrhkkk toiermncc 

The pMut-1 Anbidopsti Protos »t plasmid described sbove is very effective when used in 
muu^enesii/screening expeiiments in thai it gives a high frequency of genuine coding sequence 
nuiianis, as opposed to the frequent up-piomoter mutants rhai are isolated when other plasmids 
areuied. In an effoctlocteaie an efficient plasmid screening system for the inaizelY^ 
eDNA. the maize cDNA wasenginecftd into the pMut-l vector in appnuimately the same 
sequence eoniext as the Anbtdopsis cDN A. Using standard mediods pf overlapping PGR fusioa^ 
the f end of the pMui* I Anhidopsis clone (tncludint 1 7 amino acids of chlongplast transit 
peptide with one missense mutation as described above) was fused to the nuUe Protoi.-! cDNA 
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tequnceoininiminunawdinmtber I6(SEQlDN0.6)rf TbeJoidof 
(he maize cDNA wis mimgod, NoU redrictioo sites were pieced on boib ends cf Ihii fBsiOo» 
indtedumerictetK was ckned into the pPL6l pUmidbacUmeftompMat-i. Sequenee 
anatyw revealed sin^ oudeakk PCR-*deri 

eiiiuctootides752-7M(S£Qn>NO.S)teaiiACrcodofuboihcf whi^ This 
dumeric Arsb-macee Piocok- 1 pUsmid is dft'ignamd pMul-3. 

The pMiit-3 plasnud wis iiiiisfieiiRied iAfO the motecor XL I •Red satin as described above 
and the mutiiBd DN A WIS isobicd and plated cm a heriiid^ 
uoiiniti|eni2edpMui-3 maize pcotoxscne. HeMeide cotam oolonks we<e isolated after two 
10 days at 37*C and analysed as desertbed above. Tliis analysis leveetednwldpleftonikb 

cemaiAiqtheibiddeiesisentpRMo«CDdii^segucD^ Sequence analysis shOMd 5 single bve 
cbangesihiiindtvkiiiallyiesuliiflaheri)ieidetok^ enzytne. Tbraeofdwse 

miaatiORS con esp ond to amino aeid changes pftmously shown to confer tolefanoe at 
CO f itispo n d ingpciwtkw wtheAnab^qptf^ fene. TwAof thedMeMpMzC-lValand 

15 pMzC'llVt convening dwa!atdne(GCr)«t amino eeid }66(SEQ]DN0.6)tt>dtbervainie 
(GAT) or to threonine (ACT). Tbis position oortesponds to the pAf«C-1 mutations described 
above the third analogous change convem the glycine <GCT) el amino aeid 167 to Serine 
(ACT), corresponding to the AnC'3Set mutation described above. These results serve to validate 
Che citpectation dwt herbicide-iolenm micaiions identified in one plant piatox gene will also 
3 0 Gocirer herbictde tolc«snoe in an equivalent plant protox fene from another spocies. 

Two of the miitiiionsisolsfed from (he maize Protox-I screen result in amino acid 
changes SI residties not previoiitfyidentsried as hefbicideresistaneesittf^ One change converts 
the c7Steinc (TGC) lo phenyUJsmne (TTC) at amino acid 1 61 of the maize Piotos* I soquenoe 
(SEQ D NO. 6). The second converu the isolcucine (ATA) to thrtoaine (ACA) at amino add 

25 421. 



EXAMPLE t: 



CeobblfigMasUIMi 

1 10 CTtttc UgMy itocttoonl/btgMy I 



\ AraClCtfLett mutation described above was found lo enhance die fiindion/hefbidde 
; ofboth the ArtC-tVal and the AfnC*2Cys mutant plasmids. In anefToittotestdie 
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20 



gmnl uefiilness of ttus ieoond uie moudm, it w«f conb u iod acpmely wvtb tbe Ai«C*2Lea, 
AraC-2ViLml AnC-2Iteimiutionsandteiicdferlie^^ XiieKhctte,the 
AraC305Uu change signtfk»ily Ricretted tlie giD^ 
pcQun-inhibitinghertricide. Qnnbinatkwafche Ai«C-2Ileietiftamimit^ 
leeoiMi iitc muum AnC249Ik or Af«Cl 181^ tls6 pro^^ 
enzymes. The AJiCS49DciWitiondejiia ft tta i esthit>>eo^ 

eohcnciiis m AnC-l mutant m«y also iDcicaie icamiefi of an AFaC-2 imitBiC A ifam 
mutaiiaQ plasmki oontahiinf AnC^2ite. AriC^ 
pralto a higWy ftiictkidal lugM^ 

pnttoi* lnhlhWn t c mupwmJa 

Rcsisiani mutknt plasmi4s originally identified based cn mi^^ 
ifdubttoryhecbtcide, weie tested agiiitft a ^ec^^ For 
this tett, the SASX38 ttrain oontaiAing the wild-fype plasnud is plaied on a range of 
eoiioauratioos of each covnpound Id determine the )et^ Resistant 
mutant ptasmids in SASX38 ar& plated and scored for the ability to survive en aooncenirationof 
eadi compound which is at least 1 0 fold higher than the conc cti uaiion thai is fetfaal to chc 
S ASX38 stiaiA containing the wild-type ptasmid. 

Results from ciw-toleranoe testing, iilustnied in Tabks 3 A and 3B below, show that 
each of the mutations idemified confer lolerance to a variety of protox inhibiting oompouoda. 
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3A 



ioVi 



AM&lVii AMC-2Cri AiflC^rrir AaC-mr 



+ 



OGA276-SSi-Cii» 
a3A2487S7-Kimni 
OOA 175*769 -Itobiii-Hni 
OGA 2fi3' 195 * Sumiidiuu 
OGA2M'S93-UBiioyil 
OQA 260670 - Sami&emo 
CGA 333*K35 « Niboo-Mohyifai 
OQA 2iS*CKZ7 - SunilDaio 
«OaA 302*640 -FMC 
"OGA 335' 141 -FMC 



^ m lOX or nofC lokmt tlutt WT 

10 lOOX or more tolomm than WT 

> « no cieu lolcnnev 

• V ihei£ cofTtpoiiiMU wot tested but provided no infonnuion 



■M- 
♦ 
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EXAMPLE ]«; 



Beciuie the dcfCTibed mutoa coding fopienoes 
expmaed tmder the coQtnl of ihc native pitu^ 

teqoeaoe in its rati ve chromoiofful locixion repitaent si iJtereidive men f dr ywr a i ii g 
tnbicidctoknanptentiMdplvit ceils. Afrftfrncntorp«QcnONAcomimiii|tfhedeaM 



IS 



30 



3b 



(Ml Uckinf itt own cxpmm ii|nils (diherpfwiKKcrorS* i 
nttraAioad bj^ttt)^ ^ act^cnl nt^iecogRized msthocto (for ii rt ia n c t, ^ 
tnusfonnitkm^ diiMjem tnnsfer toprc^^ mknprojocttk bomlMRlinaa), *ikd 1 
tokmtt tnnxf oniwm* tdectoL TheinttodiccdlMAfngniemil&oeott^ 
restriction cnzynv siie or othcf soqucoot potymofphisn thit it dttradoocd by i 
nuttgcAcsU in MTTO without chnging 
As his been pttvkwsly icponed for various se l ect ibte tn^^ 

Piukowskjefai:,£X0O/ 71- 402 1 -4026 < 19881: LcetfC aL, ^ZmC^//^/ 415425 (1 990); 
^iMoaiwriaL^PiaftiJ, 7r 109-1 J9( 1995)). some innsforminism found id result fnn 
homologous intfigntion of the mutaiu DNA into the pnitox dvomosocnil loeui. or frem 
convenion of the native pnaiox chramoiomal sequence to the ianodbiced mittani toqiaeace. 
These nruformsnu are recogmxod by Um eofubioatioA of thetr herbicide-toleram phenoiyfte, and 
ihe prescDce of ihc diagnostic renriciion enzyme lite in their protOK chroinosoinal ioeua. 



(■ 



)0 



EXAMPLE 111 CoctttnicUott of PtontTrmBSforiMtioo Vector! 

Numerous trfinifonnaLion vecton are aveilible for plant transfdinution, and the genes of 
this iAvtnikm can be used in conjuiKtion with any such voctors. the selection of vector for use 

\ win depend upoo the pfefcmd tnnsformauon technique and the twget species for tnnsformaiiofi* 

i 

For ocrtiin target species, difTerent antibiotic or herbicide seteciion matters may be tJ i tfen e ti - 
Selection maitert used routinely in tnmsfonnation inclu^ (he ttpiil gem whkh oonfcrs leaistance 
to iLJAiunyctn and reitfod ancibiocics (Messing * Vkm^ /9: 259»2M (1982); Bevan fint, 
AWi«r«J^;{84^l87(i9f3>kite&srgcfieMhichcin^^ 
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DNA borders for A^iutecfcnom-mediud tmufonxiition, the RKZ-detfivecl af^ fiiMtkM for 

fDobilizitkm between £ cdli 

The pCS2001 polylfiikBr U fuhibk for the doom 

own legulttoiy npiab. 

ConsmKiigQ of pCIB Ifl md Hygreaycai Sricaiqn DcriYtf vaftawf 

The biAay vector pOB 10 oofiuiitt t fene eoe^ 
pUnts« T-DH A riflhi and kft border mpicnces and inootpontei aequ c m x * (nxn wiie hon^ 
iince|dBsmjdpRK252a2kmAgitiOfeplkmt»boch Its 
GOnsmicDon is described by ItoOtticine^ 153- i$l (tWT). Various deriv«ives of 

pCIB 10 have been oonstnicted which incofponte ihe sene 

described by Gria ef oJL, Ctfiitf 25. 179-188 (1^3)). Tbeic derivadves enabic aelectioo of 
msgcnk pUni cells oa hy^rnnycin on^ (pCIB743), or hypmydo and kaaaniytin (pClB715« 
pClB7n), 
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(2) CcmstnictiooofVectonSiiittl^forn^^ 

Tnntfttnwtioo witfaoui the lam Agrob0tunum hmiiifadentdiRmBNccn die 
nquiicmem f or T*DNA i^uenca in fhc cbotCD 
1ad±if tbe>6 leqiKiiocft a» be Dtilizdl 
whkfaoQnuinT-DNAieqiiera*, Tnniftnittion ledmiqo^ 

dcdraponfion) ind nuciouiiQction. Ttecboioc of vmot depends tvgclyaBibtpicfeiiul 
•eloctranfcrtbefpembekistniisfoffi^ Bek>w»ihecQn«tniotioii of mne typical iPocM 
d oftifa ciL 



pCIB1064 a A pUC-derived i«ecor suitibie for diiect fcne ttinsfcr tabniqiKC ia 
cocabuiAtioa widifdectkmbythehobid^ TbepUsnbdpaB246 
oorr^riifis tbe CaKfV 33S pforr^xer ift opmtk^ 

15 35S trsiiOripiiCMial tenniimcr and is dcscnboi is dw PCT published ^application WO 93^)7278. 
The 35S promoter of diiftvccw cotkUbks two ATGj^ Th^siies 
woe muuted using tttndanl PCR tedmiquti in tucfa « way as to remove the ATQsttdgetMtiie 
d» restriction tiles and ^wiJ7, Tlw AewfMnctioniitesweie96and27bp«wayfm 
uAique SoZr siie and lOl and 42 bp away fromihe acmal start titt. The resuliant derivative of 

ao PCIB246 was designated paB3025. The CUS gene was then eicised from pCIBlQZS by 
digestion with and 5ac/, the termini itndered blunt and religated v> generate pUsmid 
^ (CZB3060. The pUsmidpjrr82 was obtaii^ from die J^Inncs Centre, N 

bp5ma/ fragment eontatninf the Aor gene frocn Streptomyces vindachnmiogew was eactsed 
and inaened into die Hpal site of pCIB3060 (Thompaon et 0L EMBO J 6: 2519-2523 (19S7)). 

35 T)iisger)enucdpCm3064wbkiic(impriaes the he^ gene iin^ 
proratster and tefininator for herbicide sekctiin. a geie Cro am^ 
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nitabk fdr tf>e ckmiAg of plstf cxpr^^ 

CgnmiciCT of pSQGl? md gS0G3g 
S pSOOaStsBtntfufomM^ 

(DKFR)ttaMlecXil>knitt«EercQafen^ PGR wiiaiedlo«S9<ify 

die 355 pramocer (-£00 bp), intron 6 

vatmsUud leader «Bqutiicefh^ A2S0bpiT«cnmcModtngte£ 
dihydrofolate iniucUBe K)i>e n gene wv lOio ai^ 

were awembkd wiA %5oiJ-Psd fiafneitt ftofn pBmi (Ooniech) whidi oomphifid ttie pUC19 
vector b^kbone and tfkenop«Jinesynll^ Assembly of these fngmems Benemed 

pSOG 19 whk± conUiAs tbe 35S praiittUir 

ilwDHniKeneandthenopaliiw^lhiseicmtt^^ RcpliMnemaf teGUSkaderin 
pS0Ol9wiihthclcad«'iequciK«ftomMiizcCbloi^ Mottle Vims (MCMV) gcncrtttd the 
15 veciorpSOG35. pSOG19andpSOG35ciny ttepUC|ci»rof mpwUltnie^^ 
mndllh SpU, Psd and EccHt sites available for the doning of foreign iaqiicooc*. 

EXAMPLE12: Cotwtniction of Plittt Eiptt»rion CtsatUti 

Gene sequeiKes mnendM for eJtpitision in mn^gcAie plants ate fuatly assembled m 
20 expression cauettes behind a suitjible pcomoier and upstream of a suitable txinscripdon 

tertniniior. Thcie expression cassettes on then be easily ti«nsfcn«d to the plant transformalion 
vectors de^bed above in Eumple 19. 

2% Thetekctkmofapranncerutedinexpnaskmcatfes^ 
^ iemport]exprtiuionpatu»noftetnntgeneintheiri^ 

express tvwgenes in specific cell types (ittJi as leaf epidennal edls. mesophyOoeils, iwtconex 
cells) or in qwcifictisiucs Of Off ana (roots. leaicsorHowca foreuni^lJliid Aateleetkm will 
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feOect the de«rpd toowon <rf expression ot rt>e uiin g ei ife, AkemAiYdy«tbe«e)eciedpfQC!ioicr 
fiwy drive eacpressiCA of the 0etie UDckr « lifltt'i^^ 

A fistberiliiemative is thii the selected pfom^ This woakl provide 

the pottibilfiy of fAductag expiessiort of the iriK 
5 with a dieniie*! inducer. 



TimwriBtifflttl TcnmratOT 

A variety of tnnscr^cntlicnninitcinmevAnablete These 
are responsible fcr the tenmnatkm of tnnscnpdon beyond cbe tr m gfim and its conect 
polysknyiatiofi. Appropriate tntacriptftonalteiiniQaiofi ire thoee^^ 
in pUms and iiKhide the Ca^fV 355 Centtioaior, die 

temunator, the pea tbcS E9 (erminator. as wdl a* icmufuaofs natmally aisdCiaicJ with the pfanC 
proiox gene (Le* 'ptotox temiinacoff*). These cm be used in both moMx»yledon( and 
dicotyledons. 
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SegoenceA for the Enhancement of Rgpitatiort of EjprcMion 

Numerous sequences have been found to enhance gene exprtssioD from within the 
tiinscnptiQnal unit and these sequences can be uiad in oonjonCMn with the genes of dus 
invention to increase their expiession in transgenic plania. 
2 0 Various initon sequences have been shown to enhance expmssion, paiticulariy in 

monococy)edonous edla. F^example^theincronsof the maize ^ift/ gene have been fottfid to 
significiAtly enhance the expMSsion of Che wild-type gene under its cognate promoter when 
inEfoduced in(o maize cells. Intion I was found to be particularly cfToetive and enhanced 
expression in fu$icii consmtcts with the chJonfflpheaicol acetyltrinsfeiise gene (Callis er ai.« 
26^ Genes Develop^ 1: 1 183-1200 (1 9S7)K In the same eiperimemai system, the inoon from die 
maize ^TMtf/ gene had a similar effect w enhancing expression (Callu Introo 
seqi^nces have been routinely incocpoiaied into plant iianaformstiaB vecion, typically widda tbe 
. Aon*transIaied kwkr. 

A number of non«trinslated leader leqtiences derived from virusea are alAD kii^^ 
30 enhance expiessioo, and these are particularly effeetive in dicotyledoooiu cell^ Speetfically, 
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lettfersequenocsfcocnTobwCioMostte Vinu (TI^.lh6'^-«ifueooe'0.M«teaUoratkMo^ 
Vous (MCSfV). AiKi Al MoMk Vins (AMV) h»t be^ 

expression (^.^ GiUie «f aL NikL Acub Res, 15: S693-S71 1 (1987): Skuzeski ^/oiL Pteu Molfic. 
Bid. 15: 65-79(1990)) 



Twttting gf Bk Gtftc Produa Within the ftfl 

Vvioiu mechinisnu for ctffct^ 
icqv«iues conkr^iog the fiiiMtiom 
For exAmple, the urgetrnt of 8m producU to the cM^^ 
10 fouAd tt the Ansno temimil end of vtrious proieifi^ and which is cleaved (hmng chl^cipiatt 
in^ort yiddiAg the mature pmein (e^. Comai «f oL I BioL Ch^m, 26ii 15104-15109 (1988)). 
These signal sequeitces can be fused to hetcii)k>goo» 2^ 

heterologous products mto the chloroplast (van den Broeck et cL Na^yre J/i: a58«363 (1985)). 
DNA encodmg forippfopriite signiJflequenoesGUbeisoteedfmiiheS' eodof thecDNAs 

IS eacoding Ae RUBISCO prouan, the CAB procein, the EPSP synthase enzyme, the GS2 pmein 
and many other ptmeins which are known to be chlon^ast localized. 

Other gene products are localized to other organdies sach as the mitochondiion and the 
perouspmc {€.g. UngerwdL FlauMoUc. BioL J3i 41 M18 (1989)). The cDN As encoding 
these products can also be numipulated to effect the targeting of heterologous gene products to 

10 these Ofgaoelles. Examples of such sequences are the nuclear-encoded ATPases and spedf^ 
aspartate amine iransfcnte isofoms formitochofidria. Targeting to ceUularproiEin bodies has 
been desctibed by Rogers H dl. />/oc. NaiL Acad, 5cL USA S2: 6512-6516 (1985)). 

In addition sequences have been characterized which cause the targeting of gene products 
u> other cell eompaitmems. Amino terminal sequences are responsible for urgeting to (he ER, 
Che apopUft, and extracellular secieiion from aleurone cells (Koehler A Ha, Plaa C^U 2: 769- 
783 ( 1 990)). Additionally, amino lenninal le quenccs in eonjunction with carboxy tenxuaal 
lequeiKes arc rcsponsiMe for vawlar targeting of gene praducts(5hi&shi PlamMoiee. 
Aiojl /4: 357*368 (1990)). 

By the fusion of the appropriate taigetidg sequences described above to transgene 

30 sequences of intettst it is possible to direct the transgene product to any organelle or cell 
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oompttlmenL Fcirchloraplisttirgeting,fioreuiiipk,tfaB 
RUBISCO £pne* the CAB jeae, the EPSP ^ndttse 0Bne. or the on 
amino tontinalATG of the trtnagcne. llwsigDiJ sequence sdected should mdttdet^ 
deavtge »ite tnd ifae fusioQ constnicupd should ta^ 
i cleavage she which miequtrDd for cfeav^ Insomecases thUfcquhecBemmy beftiUUted^ 
tiK addition of a matt itttniber of immo 

or«XiefiiadvelyR^>SaeefMmof somamiiioackb wiifata Fosions 
coiwruciBd for ehleroplast impon cao be icaed for efli^^ 
traasiatioo of M wlfrotiwcribadcoutiuai«mfoIk)wedfay w 

10 MehiuqueK detoi^hefl hy fTUrtteft *>/ ii/^ tw! FAfrtlwi #< ^ (Eric ) ^gtty94a Q*'*^ 

MoWrulxrHiftiftgy , Elsevier. pp lOSt.1091 (1982); Wasmann aaLMoLGmL CeMT. 205; 446> 

453(1986)). T^Mecocmnjctionteclttuques are well 
I to mhochondria and petoxisomes. The choice of taring whkh may be req^ticd for cxpr^^ 
j of the tnnsgeneft wiH depend on Ae oellttlir localiiatioa of the pieciinor leqwiied as the stntiiia 
1 15 point for • Riven pathway. This will maally bccytoBolfe orcfaloroptastic, aWwugh »c may Is some 

casesbemiiochondriaJorpefoxisomal. Tlw products of vansfcne expression will not aonnally 

lequire targeting to the ER, the apc^last or the vacuole. 

The above (kscribed meehinisms for cellular targeting can be udltsed not only in 

conjunction with their cognate promoters, but also in conjunetioa with heterologous pmnom so 
20 as lo effeci a spedfjc cell urgeting goal under the tranacripcional regulaaon of a promoter which 

has an eiprtssion puxem difTetent ui thai of the piumocer from which the taigeting signal derives. 



EXAMPLE 13: TransfonnaiiooorDleotyledoiis 

Transformaiion techniques for dicotyledons are well known in the axt ind include 
Agrabacicnum-bMicd techniques and lediniques which do not require AgrobocUrium. Non- 
Airobacieruun lechniques involve the upukc of exogenous genetic material diieedy by 
pfotoplasuorcelis. This can be accomplished by PEC or eiectroporation mediated uptake, 
ptrtkle bombininieni-mediaicd delivoy. Of microinjection* Examples of these techniques ate 
^rihed by Paukowski al, EMS0J3: 2717*2722 (19S4). Potrykus cr oi.. MoL G«l Genet 
199: 'IG9-m (1985), Rekh €tcl, Bioudtnoiogy 4: lOOl-lOCH (1986), and Kldn erot, Mimrr 
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127: 70>73 (1W7). lo each CMfttbetrmsfomifid edit mfCfciNMtf 
sUadiTxl techiii(}ues kooiwn inthcul 

AypfcflctefiMW'incdiwd tniuf onnttiiMi is a jiLfuitd tfriinkpK for tAnifuniuiwii of 
dicocyledons because of its higfa effkkaey of 
difietentspociesv The many crop specmwhidiveraumeiyinn^^ 
inchide tobacco, tomato, sonflower, cotton, oilseed iiidpoplar(EP 
0 317 St I (cotton). EP 0 249 432 (comato, to Calscne)« WO g7Ar7299 (Anufiea, lo Calgene). 
US 4.795 (poplar)). 

Tnnsfonnatioii of the taiyet pUat ^locies by iccombinam noAomniiii imaUy i 
oo<ukiv«tion of Ihei^^ra^ocferqim withexpKamsfnmitte 
known in the Bit. Titftsfonncsd tissue ttrey uoaicJ on lekgt i h icTnodMmq^^ 
orbefbicide reaistanoe marlcer present between the binary plasmidT^DNA bonkrt. 

EXAMPLE 14: TnincforaiattaorMaMOtykdooi 

Tfiasformation of most monoeotyicdon species has now aiso become foutine. ViiJun i 
techniques include direct gene trmsfer imo pratopUsts usuif PEG or c tom ap u r ati cn tediniquea, 
and pafticlebombanimem into ca)Ua tissue. Transfonnaiionscanbeundenaken widiasinfle 
DNA species or muliiple DNA species oMnnsfocmaiioa) and bodv these techniques art 
suitable for use with thU invemien. Co^transfoanatien may have the advantage of avoiding 
ctm^cx vector construction and of gtneraiing vwsgenic pUats with unlinked loci for the gene of 
inteiest and the selectable martcer, enabling the fcmoval of the aelectab&e marker in subsequent 
^encrvtions, should this be legankd desirable. However, a disadvantage of the use of cth 
tratisfonruiion is the less than 100% fivquency with which separate DNA specks am ifltegzated 
into the genome (Schocher et oL Biouch^gy 4t 1093-1096 (1 986)}. 

Paunt Applicauons £P0 292 435 (lo Cibi-GeigyK EP 0 392 225 (lo Oba^gy) and 
WO 93Ar727g (to Ciba-Ceigy) describe techniques for the prepaniion of caJhta and protoj^asts 
from an ^lite inbred line of maiifit vansfocmaikm of pnMOpUsu using PEC V 
and the legcncntion of maize plants from transformed protoplasts. Conksn-Kamm er oJL* Pionr 
CtU 2: 603<6Sg (1990)) and Fromrn tt oL, Bi^chnoiogy A- 833-B39 (1990)) have published 
techniques for Qransformition of Algl-deriyed m*i» line using particle bo mNidmen t 
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fiB tfa en DOf t f ipplieatm WO 93^X7278 (to Gb^^Gcify) md Kooid «r aZ., Biatg ch ^ h gj^ II: 
194-200(1993)) desaibe teefmiques for (he tnaufonpation of ^ inbrtj lian of nMe by 
ptftidc bombtfdmcnL TTiis torfinKpJC iiriHutt ffunutotffr finiae <iHhfyot of lt5''2«5 tnm tengtft 
exmtd from t mftitt ear 24^15 d«y< tft^poDirution jbkI iPDS-lOOt^ BiobAics dcvioc for 
boiDbtntaent 

Tcansformttonof ricccattiteobcimdtertitenbydifoctgro hi'h i i i qir a yrilim^ 

pptxof^astsorpicriielebombinliTiem. Piocofilist-inDdtitedniisfonitttiaabtf bmde^^ 
Japanica^tf^ nd bidica<ypc& (Zbvg eldL* f Aw (UURep 7: 379-384 (1988); Sbinittnolo 
i^BOM iift 274-277 (1989); Dttta at Biouduiology S: 736-740 (1990)). Bodi types «re 
10 also rootindy tnnsfdnnahlc usin^paitsdc bomb«nfinBst (Chrisoau etcL Mioudmoiogy9: 9S7- 
962 (1991)). 

Pttem Application EP 0332 58 1 {tAGbk<ic\ty)iactfbcsvxtmqf^ 
timf<jmatkxkK)dxtptn^^ "Dieaetachfiiqiia allow Ac 

trmformiikmofOtietyitf and wheat Rmhernxxe. wheat innsfoniMtkn was been 
1$ Vasil cf at, Biouchutlofy 10: 667-674 (1992)) using pvtide bombardment into ceDs of cype C 
long-term icgenenble callto, a(4 also by VasU etai, BiaUchnoU^gy 1 1: 1553-1358 (1993)) and 
WcdLs er aL. Plant Fk^iaL 102: 1077-1084 (1993) using particle bontefdmcni of immatm 
embyok aad immauue embryoskrived ctlloi, A prefcmd technique for wbeaiDauformatioo* 
however, involves the tramfomiation of wheat by panicle bombaidment of immature crtAryos and 
20 includes cither a high tuerose or a h)g})malioK seep pfior to gei^debveiy^ Priorto 

bonifaardmenu tny number of embryos (a75-l mm in kngih) are plated onto MS mediam with 
3% sucme (Munshige & Skoog. Physiokfiia FUmtanoH 15: 473^97 (1962)) and 3 mgA 2,4-D 
for induction ofsomatic embryos which is allowed to proceed in the darL On the chosen day cf 
bomhardment, embryos are removed from the induction medium and placed onto the osmodeum 
35 (U. induction medium with sucrose or maJtose added at the desired coooentntkm, typically 
13%). The embryos are allowed to plasmolyse for 2*3 h and are Chen bombarded. Twenty 
emb^^os per target plate Utyjncal. although not critical* An appropriate gene^arryingplasmid 
(luch as pCZB3064 or pSC33) is precipitaicd onto micrometer size gold paitictes using stmdard 
procedures. Each plate of embryos is shoe with the Duf^nt Biolisties- helium device us^ 
pressure of -lOt^Opsi using a standanl 60 mesh screon. After bomhardiMttt. the emliryoaafe 
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. 4cedbttckiDiatb£d«fktoieeoverfor»bout2ib(idQattQt^^ Aikr24ha.tbe 
cmfafyos tre itnDOVul Iiimii Iho otnioi icmn md ptsat back ooto ift An ^ nm mffrfiirm wbex tfuy 
$tMy for About 1 montb before Rfcfte ttk)ii« ApproxinMtefy month Uotf the ortbiyo explaitt 
wife devdopiiif m bfyoggite callus imtiiittfe^ m^m 
N AAi 5 n^iter GA). lizrthtf coiitai]|mg ttie 

ofpCIB3064«Kl2ro^mdhQtreiBtlfrmthcc^ After Mp p< ox hn«ely ope aoolto, 

developed ibooc$ tre tmsfoted to ijr^ 

biIf-tt«tigthMS.2%iuc7W;aiidttteiim Pitem ^pplicMkn 

Oe/147,161 dcscribo inethodt for iiw^ 



EXAMPLE 15: 



A Lttiihdft Zip n genomk DN A libniy |»^^ 
whole plant) was pitfchased from Si^itaffette. Approxitnatdy 125,000 pbafe wm plated tta 
dentin or25«000pAi per t5c»Fc»t di«h and dq>Koaie lite were made w 
Scfwi membrane* (NENDupont). Theplaquelifis weic probed with the ArabidopiuPm^ 
cDNA(SEQZDNo. ) labeled with pP-dCT? by the raikdoin pmnii^ 
Tcchnoiogiea). Hybridisadoo and Wash eoiiditiom were at (^^Cas described 
Gilfaou Proc NaiL Aad. ScL 199M993 (1984). Positively hybddimg pbitoes 

purified aM in vivo e<ciied inio psiiBSCTipi ^as^^ Setjueiice fram the geflonuc IWA iBkttta 
wai determined by the chain tefltunitioo nedxid luiAg dideoxy tcrmiMion labeled with 
flttoreaceni dyci (Applied Biosyftetm. locX Oocclone« AraPTlPi^ wasdetemunedtocmtaia 
580 bp of Anbidopais sequence upctream from the initiatiiig methionine (ATG) of the PratOK-t 
protein coding leqimce. This ctom t also contains coding scqucfiee and mtmu that extend to bp 
1241 of the PttHox-t cDNA sequctise. The 580 bp 5' noncoding fragment is the putative 
Anbidopsit Ptotoa*l pimioier, and the sequme is set forth in SEQ ID No. 13. 

AntPTlFto wu depoaited Deecnber \4, 1995. u pWDC-l ] (NRKL «B-2l5t5) 
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EXAMPLE !«; 



Afull-leii|thcI>NAofih6«pprQpriiieahm ciMA is bolited as 

A EeotU-XhoI pirtUJ digest fngmem and clono^ 

pCGNl761ENX (m Exiii)|»le9 oflmeiitttioful ^ieition no. PCryiB95A(M52 lUed June 8, 
1995.|nbliMDee.2M995tsW095/34dS9X This plasntidbdi^ttd with NcoZ«ndBi^ 
topioduoBafra^fiieAieotopriscsdof thftcoiivte cDNAptesttiutscnpaonieniiiniiar 
imthcS'ttntmnsUiedsequeticeofthetmJsmofA^ Hie 
AnFTlta plismid desaibed «bm is digest 

^M^riscdof pB>iiescripcindihe5SDbppiiudv«Anb^ LiprioAoC 
these two hsgmcitts produees a fusion of the tltrnd pnitox cDNA to dK native pf««o« promoter. 
The expnession Ottiette ooiitainiAg the Pmcox-l pfomoco/PMox-l cDNAAm] terauimrfbsioo 
uexciied by digestioa with 1^ and cloned incothebiititty ^«etorpClH2UU. Thebiftsyplisnud 
is nantf ontted hy dectmponbon into Agro ba cto i um and then into Anbidopsts using the vacuum 
iflfihntiaft method (Becbcold et aL CR~ Acai Set Paris 316: 1 194-1 199 (1993). Tcaasfarniana 
expi«ising altei«d protox genes aie selected on kanamycin or oa various conoeotrations of pratox 
inhibitiikfi hertrictde^ 



30 



EXAMPLE 17: Pttdttctkm of bertkidetolenni plants by 
PivlM-l prtMDolet/alicfed Pm1ois*1 fkufioB 



ofanalive 



Using the pcoooduit described above, an Arabidopsis Pioiox* 1 cDNA containing aTAC 
to ATG (Tyrosine to Methionine) change ai nucleotides 1306-1308 in E^rOtoa-1 sequence 
(SEQ ID NO'l) was fused u» the native Piotoi-I promoter fragrneot and trnisforaied into 
Arabidopsis ihalinfia. This altered Pnxoi-I enzyme (AraC-2Met) has been shown tobe >lOfold 
nvMv coterant to varioas pcDtox-inhtbidAg herbicides than the natunlly accumng enstyme when 
tested in a bacterial expitssion system (see EaampJes 5-9). Seed from the vicauffl infiltmed 
:\o plami was collecied and plated 00 a fipgc(iaOnM-LOuM) of a piocoi inhibitory afyhi^ 
herbicide of ronrnilaXVn. Multiple espcriTPema with wild type Arabidopsis have shown that a 
lO.OnM eooccntration of this compound is sufficient to pievencnorroaJ seedling gcnninatkio. 
Tniug^ic seeds expressing the Ar>C*2Mci altered enzyme fused lo the native Procox-I 

4S 



» 

> 

i 

I 
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p. jatatptoiueedm>r(OtiAntAdapni$eedla^ 

Hut iMvaod/ftltefQd {miam ^axyme liitico thcudbfB AinciiOBt m si effective seloctiblc mvkef 
fiorplBiMttteiMtioa. Scimlof ihepliftt»thMfemmMdo0 1(X).0dM 
tmitptamfid to JoiL fmni 2-3 wedu, ai^ 

ofcbt pnatDx^uihibitUkcbetbicidc. Whcp coum wil to capQf ixoorcootgol 



j Virioot faodSfkatiou of the kmot^ 

•kiltedmihe in SucbmodifkiitkwMimeiidedtofAil wi^ 




'03if07|l4g(^)14l|l0^ %5^:B S K B 




(1) fiBIBML ZVfCiaUTZGIV: 



Iterd, Brie x. 

(ii> txtLB OF nraamcaf: fiwt >iOMftrnr.F*> emcddims fuust 

ntOTOPORPRrRXtlOQBI 0XZ13ASE AW XJ^BZTCR^RtSXmvr 



(ill) 



OF 



: 13 



(iv) CORRBSPOfOTBICX AnOKISS; 

(A» ClbB-Osiffy Corpo»tioa / IMWIt Dipt. 

it] STREET; 540 lAlit« PImLdm Rd. 

(C) CZ?V: TftrrytcM 
<D) STWl: IT 

IF) SXF: 10S91»900S 

<v> cww r cii mlaTtf^Hf.K form: 

(X) N^ZUM lYPft: FlopOy di»k 

<B) COHPUTER; XBC PQ CCWL&i^A 

fCl OPBRATXW Srsm: FC-DOS/KS-DOS 

(D) SOPTWM; FACMtXn MImM II. 0, Vmion fl.25 

(Vt) CURROT AFPLZaiTIOIf DATA: 

(A1 APPX^ZGATZQII NQXBER: US TBA 

(CJ CIA5SZF1CATI0M: 

(vil) PRIOR APFLICATIOli MTA 

(A) APFLZCATXON NlMBBk: OS 08/261,198 
(0) rXLlMQ DUTE: lH^JTlH^Bi 



(vlil) ATTORIZy/AfiDTr IMFOMATZOVi 

(A) HANS: BlMr. J«m0 ScotC 

(B) m&zsTWtiw uomsit 3«,U9 

<C) RCrOl£liCS/D0CKST VCNBSRr COC ia47/prov 

(IX) meCOHkUNXCATTGN I]IF«RKATX<»; 
<A» TCLCPHOMfi: 919*.$41-aSX4 
(B» TCLe^AX: 919-541-9609 
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ix±) sKoxE nesoamw: siq to noa: 

TGACMUUTT CCCJUTTCtt tCOGAITrCC ATO OMS TTA TCT CTT CTC CCT CCO 54 

Vet Glu Liu 9«r IM lM|u Ar9 Pro 
1 5 

XCC ACT CJU TCG CTT CTT CCC TCC TTT TCG MO OCC AAT CTC CGk TTA 102 
Ttir fhf Gin Ser xm Pro smr nm fl«r Lyv Pvo An Azv 
10 1$ 30 

AAt GTT TAT AAfi OCT CTT AOA CTC »t TCJT 1» «TO OCC CGT OOA CCA ISO 

Am VaI Tyr ly* Pro Im Arg b«ki An cy« Mr V«l Ala Gly eiy Pro 
2S 30 35 40 

ACC CTC GOk TCT TCA AAA ATC (»A flOC OGA OOA QOC ACC ACC ATC ACG 19B 
rht Val Gly 8«r S«r l#y» xl« Glu Gly Gly Gly Gly 1%ir Thr XU Ar 

50 5S 



ACQ GAT TCT CTC ATT CTC GCC OGA OCT ATT ACT OOT CTT TOC A3^ OCT 2«i 

nu- Ajip Cy« vaI II* oly sly Gly II* sw Gly Uu cys ri« Ala 
60 65 70 

CAC oca CTT GOT ACT AAG CAT CCT GAT GCT OCT CCG AAT TTA ATT GTO 294 
G1& Ala Uau Ala Xrya ilia Pro A>p Ala Ala Pro Asn L*u Xl« V#l 

75 80 as 

ACC GAG GCT AAG GAT COT GTT OGA GGC AAC ATT ATC ACT CGT GAA GAG )42 
Thr Glu Ala Lya Aap Arg Val Gly Gly Aan Zl* ZU Tbx Arg Glu 

90 »5 100 

AATOCTTTTCTCTOCCSAAGAAGCTCCCAATAOTTPPCAACOGTCTCAT 330 
Aan Gly bau Trp Glu Clu Gly Pro Aan Sar Ptoa Gin Pro sar Aap 

lOS 110 XIS 120 

CCT A7C CTC ACT ATG GTG OTA OAT ACT COT TTG AAG OAT GAT TXC GTG 430 
P^o M«t t«u Thr Mat Val Val Aap Sar Oly Lau Lya Aap A«p Lau Val 

12S 110 135 

TTG OGA OAT OCT ACT CCG CCA AOS TTT GTC TTC TOG AAT GOG AAA TTG 4B€ 
Lau Gly Aap Pro Thr Ala Pro Arg Pha V«l x^u Trp Aan Gly Lya Lau 
140 li5 ISO 

AOG CCC GTT CCA TOG AAG CTA ACA GAC TTA CCO TTC TTT GAT TTC ATG 534 
Arg Pro Val Pro Sar Lya Uu Thr Aap Lau Pro Pho Pha Aap Lau Mat 

155 160 %$S 

ACT ATT OCT CCG AAG ATT AOA OCT GOT TTT GGT OCA err OQC ATT OGA 562 
Sar lia Cly Gly Lya Ila Arg Ala Gly Pha Gly Ala Lau Gly Il« Arg 

170 175 180 

CCGTCACCrCCAOOTCOTGAAGAATCTGTGGMSGAGTTTGTACOQCGT 630 
Pro Sar Pro Pro Gly Arg clu Glu Sar val Glu Glu Pha Val Arg' Arg 
185 190 ISS 200 

AAC CTC OCT GAT GAC CTT TTT CAC CCC CTC An GAA CCC TTT TOT TCA 
Aan Lau Gly A«p clu Val Pha Clu Arg Lau Zla Clu Pro Pha Cya sar 
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R:624 



P. 53 



205 



210 



215 




aeST GTT TAT OCT «r GXT CC7 TC» AJkA CTO A9C An# JUU OCA C» n? 72« 
Gly VAl Tyr Ala Gly A«p Pro ty» 1*^ l0f« Al* Al* Ph« 

320 225 230 

006 AAG 9tT toe AAA CTA GAC CM AAT OGT OnA AOC ATA ATA OCT OOT T74 
Gly Ly« vaI Trp Lya Olu Gin AMI Gly Gly $mr Il« lit Cly Gly 
235 340 2«5 

ACTTTrAA6CCAATTCAGaACAOCAMAJkCO«CCCAAtfOCAG*ACGA 022 
thr Vtm IM Ala tlm Gin Glu Arg Ly* A«& aU Pro Lyv Ala Olu A^ 
250 355 260 

GAC CCC eCC CTC CCA AAA CCA C*0 OOC CXA ACA CTTT GOT ICT TTC AOS $70 

A»p Pr« Arv ftf a ty» Pro Gl» Cly cln ttir Val Gly s^- Ph* Arj 
265 370 275 2*0 

AAfi OGA err CGA ATG tTG CCA GAA GCA ATA «t OCA AC» TTA OCT *0C 9H 
Ly» Gly t«u A«v Mec tittu Pro Gly Ala tl* 6«r Ala Arg Gly 
285 250 25S 

AAACrPAACTTOTCrTOGAACCTCTCAGOrATCACTAMCTGGftCAOC 944 
Lya V*l hym Umx Bmv Tzv Lym pmx fiar Cly Ha tlir I.#u Glu S«* 
300 305 310 

OGA OGA TaC AAC TTA ACA TAT OM? ACT CCA OAT OCT TTA CTT TCC OTC 1014 
Cly Gly Tyr Aan Lau Thr Tyr oLu Tbr Pro Aap Cly Lau Val Smx Val 
^ 315 350 325 

CW5AGCAAAA£7PGTTGTAATOA£5SCTGCCATCTCATGrTOCAAGTOOr 1042 
Cln 5«r Ly« Act Val v*l Hat Thr VaI Pro Hla Val Ala s«r cly 
320 335 Ut 

CTCTtOOOCCCTCTTTCTGAATCTOCTOCAAATOCACTCTCAAAACTA 1110 
Lou &«u Arc Pr6 t«U sor Glu Sar Ala Ala Aan Ala L#u $mr l>ym Lau 
345 350 355 340 

TAT TAC CCA CCA CTf GCA OC* GTA TCT ATC TCC TJUt CCG AAA GAA OCA 1158 
Tyr Tvr Pro Pro V*l Ala Ala Val Jla Sar Tyr Pro Ly« Glu AlA - 
365 370 375 

ATC CGA ACA GJLA TCT TTC ATA GAT OCT GXA CTTA A*C OCT m OGC CAA 1^04 
lU Arg thr Glu Cya Lau Ila A«p Gly Glu Leu Lya Gly Pkio Gly Gin 
380 305 350 

TTG CAT CCA COC ACG CAA QG* OTT GAA ACA TTA OCA ACT ATC TAC ACC 12S4 

Lau Hia Pro Ara Thr Gin Gly Val clu Thr Uu Cly Thr Ila tyr sar 
395 400 405 

TCC tCA CTC TTT CCA AAT CCCGCACCOCCCGGAAGAATTTTaCtCTTG 1302 
Sar Sar Lau Ptia Pro A<n Arg Ala Pro Pro Gly Arg Zlo t^u Lau Im 
410 415 430 

AACTACATTOOCQOCTCTACAAACACCOGAATTCIGTCCAAOTCT GAA 1350 
Aan Tyr Xl* Cly Cly Sor Tftr Aan Thr Gly llo Lau Sar Lya Sor Glu 
425 430 435 440 

GOT GAG TTA. CTC GAA QCA GTT GAC ACA CAT TTG AOG AAA ATG CTA ATT 1398 

Cly Clu Lau val clu Ala v«l Aap Arg Aap Lau Arg Lya Mac Lou 4la 
. 445 4S0 455 

AAG CCr AAT TCG ACC CAT CCA CTT AAA TTA OCA G^ AGO CTTA TOG CCT 1444 
Lya Pro Asn Sar Thr Aap »ro Lau Lya LOU Cly Vol Arg val Try Pro 
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460 



465 



4-0 



CXX OCC ATT CCT CAfi TIT CTA <7fT GOT cac TTT CUT AlC CTT QabC AOC 
OXn AU Xl« fro Gin «ba L«u Val Qly Uia Am A*» lU Lm A«P ttar 
47S 400 485 

OCT AAA TCA tCT C9A ACS TCT TCQ aOC TAC GAA 000 CTA TTT TTI# OOt 
AlA 1^ M s«r Lmi Ttor ter S*r Oly Tyr Glu Gly IM Pte L«u Qly 
490 495 500 

OOC AAT TACCTCCacrOCTOTAOOCrrAOGCCaGTTTr OTA- GAA GDC OCA 
Qly Am Tyr v«i A14 Gly VaI AIa L«i Gly Azv Cy» vaI gIu Gly aU 
905 510 515 530 

TAT CAA ACC OOC ATT GAO QIC AAC AAC TTC ATQ TCA COG TAC OCT TMS 
Tyr tilu Th^ AlA Il« Olu Val Am Asn Pte mt scr Arg Tyr AU Tyr 

525 530 535 

AAC TAAATC7CAAA ACATTAAAIC TOCCAOCTTG OGTCMffTTT AtTAAATATT 



rrGACAXATC CAAAAAAAAA AAAAAAAA 



i7] IVfWOmm f(» 8B0 X9 not): 

(i) smsoBKZ auRAcnomcsj 

(A) iMWi 537 AaiM ACids 
CB) TYPB: AJAino acIO 
(0) TOFOLOGITi IIhaaT 

(ilT MGLBCCLS rm; psrotAla 

(xi) StQODCt DSSOUmOH: SBQ ZD HO:): 

llAt Glu Lau Sat Lau Arv Pro TSir Tte G1& Sac Lm L*tt Pro SAT 

15 10 15 

PhA s«r LyA rro Am lau a^v ^au Aa» v«1 Tyr Ly« Pro Lau Arg Lau 

20 25 30 

Arg Cya SAr VaI AIa Gly Gly Fro Thr VaI Gly Sat Sat Ly« ZIa Glu 
35 40 45 

Gly Gly Gly Gly Tbr Tttf* tlA Thr Tlir AAp Cy« VaI Zl« VaI Gly Gly 
SO 55 €0 

Gly IlA SAr Gly Lau Cy* llr aIa Gin aIa Lau AIa Thr LyA Hii Pto 

65 70 75 eo 

A^ Al« AIa Pro aah Lau z1« VaI Thr Glu Al« LyA A«p Arg VaI Gly 

85 »0 95 

Gly Aah Ha XIa Thr Arg Glu Olu A«n Gly PhA L«u TTP Glu Glu Gly 
100 105 110 

Pro AAfi SAr PhA Gin Pro Sat AAp Pro imtx Ttix Mot vaI VaI Acp 

lis 120 125 

0Ar Gly LAU LyA A«p AAp Lau VaI Lau Gly AAp Pro Thr aIa Pro Arg 
130 135 140 

PhA vaI LAU Tsp Aah Gly <.yBLAUAxgProVAlPro5ArLyaLAuTte 



1494 



1542 



1590 



1638 



1491 



1719 
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c>uatL. 



I r 



14S 



ISO 



155 



ICO 



Asp L«u ^ Hm M MP Um ttoe atr II* Oly Oly Lyi Xl« Azy JOa 

1«5 170 in 

Cly Fba Gly AlA UrU Oly Zl« Arg Pro s«r m Pro Gly Azy olu 9lu 
180 IBS 190 

Smt V*l Glu Qlu Ph» V»l Arg Aro Aaa Lm Gly Mp Clu Via 91m 61« 

195 300 a05 

Arg tivu Il« Glu Pro fKm Cy« S*r oly V*l Tyr Al* Gly Mp Pra s«r 

310 215 230 

l4r« IM 8«r ty« AU AlA PhA Gly VaI Tt» Ly» liw Glu Gin 
325 330 235 240 

Am Gly Oly S«r II* II* Cly Gly Ttar PIM Lya Al* 11« Gljl Oltt Azv 
245 250 2SS 

Iw AM AlA Pro tjffm AU Glu Ary Aap Pro Acy Pre Lyo Pro CId 
240 US 370 

Gly GlA Tbr V«l Gly S«r Pb* Arg Lys Gly IM Arg nut L«u Pro Glu 
275 340 3IS 



AU Il» S«r Ala Arg Lm» Gly Sux Lym VmX LV* 

3»0 « 395 200 



ftp Lya L«u 



S«r Qly ila nir lyv L*ii Glu fl*r Gly Cly ryr Am L«u rhr Tyr Glw 
305 310 315 330 

Ttur Pro A«p Gly Umi v«l 5«r V«l Gin s«r i,ym Smt VaI v«1 noc Ttir 
335 330 33S 

V«l Pro S«r Hi« V«l Ala Smt Oly Lmi Lm Arg Pro S«r Glu Smx 
340 345 350 

Ala AU Am Ala tau Smt Lya Lau Tyr Tyr Pro Pro Val Ala Ala Val 

355 3C0 )£S 

Sar Zla Bmr Tyr Pro lys Glu Ala lla Ary Thr Olu Cya tau Zla Aap 

370 375 3«0 

Cly Glu Lyv Gly Pha Cly Gla Lau Hla Pro Arg Thr GIa Gly Val 
385 390 395 400 

clu Thr UfU Gly Thr lla TVr sar sar Sar Lau Pba Pro Aan Arv Ala 
405 410 415 

Pro Pro Gly Arg zla Lau Lau Im Aan Tyr Zla Gly Gly Sar Thr Aati 
430 435 430 

Thr Cly lla Lau Sar Lya Sar Glu Gly Glu Lau val Glu Ala Aap 
435 440 445 

Arg Aap Lau Arg Lya Mac Lau Xla Lya pro Aan sar Thr Aap Prv Lau 

.450 455 440 

Lyi Lau Gly Val Arg Val Tvp Pro Gin Ala lla Pro Oln Pha tau Val 
445 470 475 480 

Gly Hlo Pha ASP Zla Lau Asp Thr Ala Lyo Sar sar im Thr Sar Sar 
445 490 499 
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P, 55 




p. 56 




Oly ivr Olu Qly Lm FlM tM Oly Qly AM tyr VAl m Ay v«l 
SCO 505 510 

Oly ATv Cyw v«l olu oly Ma vy^ Olu AXa X1« Olu Ml 
515 520 525 

AaPh«iMS«rArgTyrAlATyrty* 
530 535 



(1) mfQUMAROil rat 9BQ ID 90:3: 
(11 



(X) UMTHi 1730 bAM pftin 
(B) TYPE: aiicl«ie acid 
IC) mANXOOnsS: tiJigl* 
(0) TOraMT: llMtf 



(11) MOIbkCdU TTPS: 
(lV> AmX-SBV: W 



(X) lOUtt/KZy: CtMt 

(HI LOCXnOK: 70..1SS0 

ro) OTBSR ZMFOIWmcil: /fM«#« 'AzvbldcVMiA P«otOT-2 

" znoe-i" 



(Ix) 



(Xl) SlOmCS DSSCKtmOV: S*Q ID in:5t 

ii'isTTJkcrr xtttcoci*cx ctqc^ttcqx cTmtfTfcflAQ xirrvMMiTC rauviuviXtt 

OUUTMaCX XTQOOOTCTCaaXOCXOTAOCXCUTCXTCAXXnGMGCO 
Mat XU fier Oly Xl« VAl XlA X«» Sis 61a Xl» Gly XIa 
15 10 



60 
10^ 



15 



30 



30 



35 



40 



50 



$5 



(5 



70 



to 



1$ 



95 



100 



OOP 
Oly 
25 


OCX 
VaI 


Mr 


OCX 

Gly 


CTT 
IM 


15C 


XXT 

xim 


Val 


ACT 

Tto 


C7TC 
V«l 


T« 
45 


304 


wr 
S*r 


VAl 


XTQ 
M4C' 


cxx 

Gla 
CO 


xxr 

AM 


353 


xcr 


OM} 
Glu 


OCT 

XlA 

75 


Glu 


OCA 


300 


COT 
Xry 


oxs 

Glu 

to 


XXX 

Ly» 


CXX 
Ola 


cxx 

Gin 


340 


000 

xr« 

109 


xxr 

XM 


OOT 

cay 


orx 
vml 


OCT 

tX9 


W 



S2 
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Val Mt LAV Pro Thr Mm 1^ H 

1X0 115 



130 



US 



160 



1«5 



175 



Hat Ly« Hi4 9«r 

IflO 



180 



185 



210 



225 



240 



245 



255 



2ft0 



27 Q 



275 



290 



305 



320 



325 



535 



340 
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Cite 


era 


cwc 
Val 

120 


ACA Adt A9r OfO CTC 
nir S«r Mr VAl X^ra 
125 


444 




ttc» 

135 


OAA 
Olu 


CCA TIT rtX TOO am; 
Pro Vte trp Ly» 
140 


492 


OCA 
Al« 

ISO 


TCT 

s«r 


OCT 
AU 


OAA GAA A0t OTA AOC 
Olu Glu 8«r VAl 9«r 

155 


540 


CAA 

GlA 


<Slu 


OTT 
V«l 


Orr GAC TAT CTC ATC 
V«l Asp Tyr Zl« 

170 


588 


OCT 
Alft 


COS 
Ala 




CCt OAT tec CW tCA 
?ro A«p Smt Smt 

las 


(34 


TOS 


AAT 
Atn 


OTA 
VAl 
200 


CAO AAA A9r WT OOC 

Glu Lya ter eiy 
205 


«M 


ACA 


AM 

215 


TTT 
Pb« 


OCT OCT AAA GOT nt 

AU AU t-y* Gly Gly 

220 




OCT 
Pro 
230 


OOC 
Oly 


ACA 

Ttar 


AAA AAC OCT TOG OCT 
Ly« Lyv Gly S«r Ax9 

335 


780 


ATS 


CAG 
6lji 


AW 
lU 


CW CCT GAT ACC TTQ 

Uu Pro Afp tbr Uu 

250 


629 


ATC 
Xl« 


AAT 
AU 


TTA 
IM 


OAC TCC AJkO OTA CTC 
A«p Smx Ly8 V«l Umi 
255 




AGA 
Arg 


CAO 
Cln 


Glu 
280 


AAC TOO TCA TTA TCT 
A«n Txp 8mt L«U Smr 
285 


924 


JkBA 
ATfl 


CAA 
Gin 
295 


AAC 
Am 


CCC CAT TAT OAT CCT 
Pro Kifl Tyr Avp AI4 

300 


972 


AAT 

A0R 

310 


Gits 


AAC 

x-y» 


OAS ATO AAC CW ATTS 
Olu Mt Ly» VAl Mt 
315 


1020 


AAC 
Au 


TTT 


etc 


CCC GAO AW AAT TAC 
Pre Clu rl« am Tyr 
330 


1044 


ACA 
Thr 


TTC 
Mm 


ACA 

Thr 


AAC GAG AAA CtA AAC 
Ly* Glu Lyv V*l Lya 
345 


1116 


etc 


AW 
XI* 


CCA 

Pro 

3(0 


TCT AAA GAO CAA AAO 
a*r Lyn olu Ola Lyv 
3CS 


1144 




CM OC?r TTC AAA ACT CTA OCT AO^ CTT TTT TCA 
HlB Gly Ph» Lya ttor L«u Gly tttf Mi Phm £w 
370 375 

OIkT COT TCC CCT ACt OM: on CAT CTA filT ACA 
Atttt An Mr Pro S«r A*p v«l Hi* tmu Tyr "Tht 
305 390 

AOTAOGAACCMCGAACTAOCCAAAOCTTCC ACT 
Smt AdTff AMI GXn Glu L«u Al* tya Ala S«r Thr 
400 405 

OTTOlCACTTCTaACCTTCAacGACTCTrOaOO 
V«X V«a Tte Mr Asp Zitu Gin Asg l«U Gly 
41S 430 

<7rC TCT C7TC AXC CAT TAC TAT TOO AGO AAA OCA 
Vml Am His Tyr Tyr Txp Arv l.ys Al« 

430 435 440 

AOC AOe TAT GJtf TCA ore Aie OAA OCA AIT GAC 
8«r m Tyr Aq> S«r VAl Hat Qla Alt Xl« A«p 
450 455 

CTA OCT OOC TPC TO TXT OCA OCS? AAT CAT COA 

Uu Pre Gly Mit Phi TVr Al« Cly A<& Kit Arg 

463 470 

GOG AAA TCA ATA OCA TCA OCT TQC AAA OCA OCT 
Gly Vyw S«r Zl* AlA Mr Gly Cys Ly» Alft Ala 
480 405 

TAC CTC CAG TCT TOC TCA AAt GAC AAG AAA CCA 

Tyr L^u Glu Mr Cys S«r A4n AJtp tya l4r> Pro 
495 500 

AAooncoTC: cctttttatc AcrrAcnro taaacttcta 
ogattagcca acaactcaoc aaaacccaoa ttctcataag 

ACTATtt ATC TAAAA 



TCA AtG ATG nr OCA U3Jt 
Smt IMC Mt Wtm Pro 
380 

ACT TtT ATT OCTT 000 1360 

Thr Ph« Zl* Gly Oly 

395 

OKC GAA TTA AAA CAA 1300 
A*p Glu V&M Lyfl Ola 
410 

OTT OAA OOr GAA COC 1356 

Vil Olv Oly Glu Fro 

435 

nC COB ttO TAT OAC 1404 
»ro L«u tyr Asp 
445 

AIU3 ATG GAG AAT OAT 1453 
Lym *J^t Glu An Aip 
440 

OOG OOG etc TCT OTT 1500 
Oly Qly UU 9ir VU 

475 

OAC err OIO ATC TCA 1544 
Aflp L«u V4l tlm tar 
4d0 



AAT GAC AOC TTA TAACATIGTC 

AM AfP S«r Zivu 

505 



1403 



AAAlOaUhCA AOOOOCOOtG 
QCICACTAAT TCCAGAA1AA 



1443 
1733 
I73fl 



ZNFOnUTXGH PGA SCO 10 >IOi4$ 

at 5S:0CJOICE OCARACTexXfTXCS; 

CA) IfNCTHt 508 Amino «ci4a 

CB) TTPe: UllftO «cid 
<DJ TOPOLOGY: llMAr 

(Li) NOLXCULC TTPEr protein 

Ixi) fttOUDCI &£SC1UmON: SBQ ID 1^:4: 

Mat Al« tar Gly Al« Vml Al> A«p Hip Gin Zla Glu AIa VAl Smt Oly 

15 10 15 

Lv« Arg V«l AU V«l VaI Gly Ala Gly V*l tar Gly L«u Al« AU aIa 

30 35 30 

Tyr £.y« Uu ty* tar Artf Gly Uu A«t V*l Thr Wml Pttf Glu AIa Asp 
35 40 45 
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Cly Xrfl V»l Cly Cly ty« Uu ATB S» vml ll*t Ol» A«fi flly Umi llm 
50 55 ^ 

tzp hmp Olu Cly Al« An tkr ll«t olu Ala Olu Pxo aXu v»l Gly 
70 7$ 80 

S«r Uu Uu A«p A«p Uu Gly L«i Arg Glu X*y« GlA Gla PlM pro Zl« 
05 90 »S 

S«r Ola Cor« Ly« Ar« TVr Zl« VAl Arg Am Qly VaI Pxo VaI ll»t Lm 

100 105 no 

Pro Thr A«a Pro XI« Glu Vtl Tlur Smx s«r VaI Im ittr *Ar Qln 
lis UO 125 

CwZoninMeaflZlaZMlMGIuPt^FtMEMttptMlyalMMr 
1^0 135 140 

fi«r Ly« V«l ter A«p A3l« Mr AU Giti Ola ter v«l S«r Olu Pb« vto 
1« 150 15S ItfO 

Ola Azy Bit »M gly OlA Glu val Vttl A^ tyx IM 21« Aip Pro Mm 

1«5 170 175 

VaI Gly Oly 'Rir 8«r AlA AU A«p Pro A«p L«u Mr Miie Lyv flia 

l&O 1B5 1»0 

S«e Ph« Pro A«p l*u Am v*l Glu M Pte Gly fl*r 11« Zl« 

195 300 aos 

V4l Oly AU Il« Ar« Tfar Ly« Ph« Ala AU Ly« Oly Gly S«r Arg 

210 ais iao 

A*p Tbr Ly» smr smr Pro Cly Tbr tyv Ly* Gly s«r Ar« Gly S«r P^ 
225 250 335 340 

S«r Pb* ly« Gly Gly Mt Gin lU X«u Pro Atp Tfar Lai Cys S«r 
349 350 3SS 

Xiou Sor Kl« Asp Glu Xltt Am L«u Asp 8«r Ly« Val Uu $«e £m S«r 

240 34S 370 

TVr A«n S«r Gly 5«r Azv Oln Glu AAA Trp S«f LeU 5«r Cyt V«l Sar 
375 340 aS5 

Ml« A«i Glu Thr Gin Arg Gin A«i Pro Him Tyr Asp Al* V«l tlo 
390 295 300 

Thr Al« Pro Lou Cyo Ami Vol Xyo Glu Moc tya Vol Moc Ly« Gly Gly 

305 310 315 ' ' 530 

Gin Pro Phm Gin Lau Am pho Lou Pro Glu ilo Aon vyr Not Pro Lou 
335 330 33S 

Sot V«l Lou Ilo Itir Thr Ph* Tlir Lyo Olu I^o Vol Keg Pro LOu 
340 345 350 

Glu Oly Ph« Gly Vol *mx Xlo Pro Mr Lya Glu Oln tyo ttio Gly 91m 

3SS >4Q 34S 

Ly« thr Lvu Gly TAr Lou Pho Sot Sor Hoc Mot Whm Pro An An 9or 
370 375 310 " . 
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400 


1^ ttla Vml 


Vml Hkor 
415 


Olu Vro V«l 
430 




Tyr A«p ter 
445 


9mr TfT 


A«a A«P L«u 

4fi0 


Pzx> Gly 


s«r vml oly 


460 


II* 9«r )Vr 


495 




CAf HtM/WXti CDS 

(ft) tOCATZCSM; 3,, 1453 

eull-l«naUi) f — <ya f) C# tnm tMX:-4' 

5EQ0B>Ce OESOUPrXON: SEQ td ItOsS: 
CAATlCGOCQGJUITOeCtCmmGOCOGAacCATCACTOQCCtC 

Xan Sor Ja« kmp Cyo V4l VaI VaI Gly Cly Gly Zlo S«r Gly L«u 

15 10 15 

•roe ACC OCG CACOOQCTaocCACOOQOCACaaeOTCOOCaACmCTT 

cya nr ai« «1a ai« aIa Tfir Art Hi* aiy vaI ciy Aap vaI Vmi 
20 3S 30 

ore Aco luo oec coc occ ooe eoc ooc ooc aac at? acc aoc otc cAa 

Vftl mr Glu JUa Ar9 Al4 A^ yro Gly Gly Aaa ll* Thr VaI Clu 
5» 40 45 

ooc CCC GAG GAA GOO tAC CTC tOQ OMl QJU3 GOT. COC MC AGC TTC GAG 

Ar« Pro QU Oly Tyr Lm Trp GU Glu Oly fro A«a 8«r Ph« Gla 

SO s$ CO 






pr« 8«r MP 9rQ VaI Hur Itftc JOA vmX- iU» S«r GXy L«i tyv Aap 

6S 90 75 

QMT TTC tTTf ITT OOS QAC CCA AAC OCG COCr COT ^nc CTHS CTO T03 OMl 

Affp IM Vid Ph« Oly A«p Pro An hXm Pro atv Mm v»1 Xmx rrp oiu 
»o as 90 95 

oly tys XM Arg Pro v«a Pco 8«r Pvo M« A«p tmi Pro Ite Mm 

100 lOS UO 

OATCrCATGSiUXATCCCAOCKMGCtCJMQGOCCOOTCrXCQCC^ 
Afp Zm Mat Sar XI« Pro Gly I^y* Xm Ax« AU 0Xy Oly AXs Uu 
1X5 120 12S 

0CXA1C0CKC08CCtCCTCCAaXC«0AAeM9CA9C6<MCMfTC 
Cly Xl« Ar9 Pro Pro Pr« Pro Cly Arg Clu Glu S«r V4I (&lu Olu Pbt 
UO 13$ 140 

eTGOOCCOCAACCtCOCffOCTMmTTtaUidXCfCAtTaMCeT 
V«l Aro Arg A«a Mu Qly Ala Clu V«l rtm slu Arg L«u I1« Clu Pro 
14S ISO 155 

TK;ilXTCAO0TGTCTATOCrO0TCMOeTTCTAACCXCAiaCAT6AlUS 
Pb« Cyv S«r Gly Tyr Al« Gly A«p Pro S^r Liy« Uu S*r M«t 
1$0 165 170 175 

OCTOGJimOOGAACGTrTCCCGGtlOCAACaAAACtOaAGerACrAfT 

AU Ala Pb» Gly v«l Ttp Arg t«u Glu Glu Thr Gly Gly M U« 
ISO 185 190 

AnO^GOAACCAlCAJtGACAArrcAOGAGAaGAOCAAOAATCCAAAA 
ti* Gly cly Thr Xl« iyt llMf !!• Gin Glu Arg S«r tyo A*a Pro &y» 

195 300 ;tos 

CCACOOAOSGATGCCCOCCrrCOGAACCCAAAAGOQCAOACAGnOCA 
Pro Pro Arg Asp Al* Arg lmu Pro Pro lyg Gly Gla Itir VaI AXk 
310 215 2)0 

TCrnCAOGAAGGOTCTTOCCATGCrrCCA AAt COC ATf ACA TCC M£ 

S«r Pho Arg Lyo Gly Uu AlA Noc Pro Am Ala xio ttir ««r 8«r 
235 130 23$ 

rnSOGtACtAAAGtCAAAefATCATOgAAACTCACGAOCATrACAAAA 

l«u Gly Mr Lyg Val tyo Uu fl«r txp Uru tbr s«r zi« Hur Lyt 
340 24S 250 255 

TCA CAT OAC AAG OGA TAT CTT tTG GAG TAT GAA ACQ CCA GAA OOC GTT 

Sor A«p A«p Ly* Cly Tyr v*l Uu Glu Tyr Glu Thr Pro Glu Gly val 
240 265 370 

<7TT TCCmCJkCOCTAJUAGTGlTATCATG ACT ATT CCA TCA TAT CTT 
Vol smr V«l ClA Ala Ly> Sor Vol Xla Mot Thr Xlo pro M Tyr Vol 
275 250 215 

OCTWCAACATTrWCXrrCCACTTTCAAOC OAT OCT OCA CAT OCT Ct A 

Ala Sor Aai) Xlo Uu Arg Pro UU Bar Sor A«p Ala Ala Aop Ala Um 

290 295 200 

TCA AGA 1TC TAT TAT CCA CCC ffrr OCT OCT Of A W CTT TCO TAT CCA 

Sar Arg Pha Tyr Tyr Pro Pro val Ala Ala Vol Ttur Val 8or Tyr Pro. 
305 310 315 



57 



334 



383 



430 



470 



524 



574 



422 



470 



711 



744 



814 



642 



910 



951 
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ty« «la kU Urn Keg Vy^ Clu Cys i<«a Xl« A«p Cly alu Lmi GXa Qly 
339 335 330 335 

Ptift Gly QlA IM His »ro Arg s«r Gla 61y Val Glu n&r Uu OXy Ibr 
340 34S 350 

AtAtJ^K^TCCTCACTCmOCAAATOOtfaCrCCrWaOTJWm 
tU Tyr 3«r 8«r s«r Sm Pfaa >ro Asn Am Ala Pr« A«p Cly Az« V*l 
3S3 3€0 3($ 

rrAcrrctAAAcTACArAOGAOffTCcrACAAj^AJCAOQAATrGrrrtcc 
Lm Uu Uu An lyr ll« Gly Gly AlA Ttar An Thr aiy ZIc V«I tar 
370 37$ 3tO 

AMACTOAAMrOftCCTOatceAAmmOAeOOfGACCTCeaAAAA 
tya tnr Olu ««r Olu LW V«l Glu AU V*! A«p Azv A^ Uu Akv ty» 
305 390 W 

ATOCTTAtAAMTCTAeAOakfimOKCCCfTrAmerfaorCtTOQA 
Mt IM ZU A«a fi«r 9hr AU VAl A«9 Px« V4l MU Oly V«l Arg 
400 40S 410 41S 

qnTOGccAcw^OQCATAmcAomcMorAmcArcrroATeTT 

VaI TKp m Qin Ma XI* PCP Gla Ptot Lva Vml Gly lU* Awp IM 
470 4aS 430 

m CJkA OCX OCA AAAOCTOCCmCJ^CCAOe7rCOCT>CaATCQBCtO 
Uu Clu AU AXa 14« Ma Ala LMi Aip Arv o;y 6iy Tyr Aa» Oly M 
439 440 445 

mctAOBJ^owAACTATGrrdCAoaAantxccTGaccjiCAtacotT 
PtM Umi Gly cly Aan Tyr Val Ma Cly Val Ma Xi*u Gly A^ Cyi Val 
450 4S5 4£0 

GMOOCOOOTATGAAAfftOCCTOOaUATATCTGIUrTTCTTSACCAM 
Olu Gly AU Tyr Glu S«r Ma S«r GlA U« S«r Aap ptM Uu t&j Lyt 
44$ 470 475 

TAT OCC TAC AAG TCAtQUAOA MOtOGWOC TACTTGTTAA TOG TTr A TGT 

Ty* Xla Tyr ty» 

400 

T0CATACA7G AOQfQCCTCC OOQOAAAU A AMCHGAAT AOtATmTT ATOTTATPr 
tCTAAATTGC ATTTCTCSTTC TTTmCTAT CAGTAATTAG TTATATmA GTTCTCTWB 
J^TTSTTCt GTTCACTPCC CTTQUUUM AATtTTATfr TtCATTpm TATGACAOCT 
OTOCTMTTA M A MAMA I A>AAMM 



12\ ItmiMATXON nm. SCO XD 110:4: 

a\ fti&QOiHc* oMAcroumcit: 

(A> UHOtM: 4*3 amino mcL6m 
4Bt TYI>S: aaUo acid 
<D) fOVQLQOri liAaar 

Ul> MOUCVU TYPet pMtaln 

.(ICIJ iBOUmrt MSClimtONi; 5CQ ZD II0;4} 



58 



1004 
10S4 

lloa 

1150 

1191 

1144 

1394 

1U2 

1390 

1434 

1490 

1550 
1410 
1470 
1490 




1 


Alt Asp cys 


Vftl V«l VaI 


Gly 


Gly 

10 


Gly 


xlo 


Sor Gly Lots Cyo 

15 


Thr aIa 


uj,n Axft Ipwu 
30 


AX* n3Cy 


His 
25 


civ 


v«l 


Cly 


Aop Vol Lou Vol 

30 




Al* AT^ A>A 

3S 


ftM« 

ACy «ro wi-y 
40 


u*y 




tlo 




Tftr VaI Glu Arg 
45 


Pro Olu 
50 


Giu cjy Tyr 


^■U TXP viu 

S5 




^»Ay 


Pro 


Am 

40 


5or Plio Gin Pro 




Vtf« Vml L«u 


Thr Mrtt. JIIa 
kmii nvv 


Vtol 


A«0 


d«r 
75 


Gly 


iM Lyo Aop Aop 

80 




VIm 

r«w *»*y *»P 
83 


rzw jwvn 




Artr 

1*11 

90 


Pht 


VaI 


IM Trp Glu Gly 
95 




jwn nrp 
100 


fi*^ TaM 

*or* 


105 


AlA 




Lou 


Pro Pha Ptaa AOD 

UO 


f .All liAfr- 


<iMr tl* 9m 
sm* **■ WW 

115 


civ l^U 
V-^y mJrm 

120 




AIs 


Glv 


iM 


Gly aIa Lou Gly 

125 


L30 


riv rxw rrv 


Dvy« nlv Xy>fl 
135 


Glu 


Glu 


dor 


Vol 
140 


Glu Glu Pho VaX 


145 


A»i] wxy 


X\m rtlu Uml 
AA# W-^U T«A 

150 




Gill 


155 


Lou 


tlA Glu Pro Hao 

^i^W 0*^0f A • v-m^^ 

160 


Cyv 5er 


Giy v«i Tyr 
155 


Ain Gj^y ASP 


Pro 


170 


Ly» 




tf^^ M*^ T\M JLlft 
0fiC^ •ijfm AAB 

175 




Giy i»yi v»i 

100 


Trp Wnj 


185 








fill/ C«v Tl« Ha 
w^y >>AV **v 

190 


GXy Cly 


Tttr Xl# Xtyi 
195 


i*fW># 

Tac Zi^ 

200 




Arg 






\.m9i Vt>a Lvm Vhfn 
Avn jmi mj9 r*v 

205 


2X0 


Ai^p AI«L At? 


IjbM rTO Mys 

215 




fciya 


Giy 


Gin 
230 


VaI XIa Ser 


225 


l*y« GXy i«u 


JU# Iwt uiU 

330 


Pro 


Affi 


uo 

335 


IXO 


'P'^^ fS^v Cmt* T^AKl 

TUf drB7 invu 

340 


Cly S«r 


Ly« v«l l»y» 
245 


uteu S«r Trp 


Ly* 


Lou 
250 




Ser 


lie TTir Ut^ ''^r 

355 


MP h»9 


Ly* Gly tyr 
340 


V«X Uu Glu 


Tyr 
245 


Glu 


Tlir 


Pro 


Glu Gly vol vol 
270 




GlA Al« 1^9 
275 


5«r VaX Zl« 
290 


N«t 


Thr 


llo 


Pro 


5er Tyr Vol Ala 
385 


s«r Asa 

390 


XX« AT? 


Pro L«u 5«r 

395 


5«r 


top 


Alo 


Aim 

300 


Asp AIa Lou Sot 


ATfl Wie 

305 


tyr Tyr Pro 


Px-o V*X Al« 
310 


AlA 


Vttl 


Thr 
315 


Vol 


Sor Tyr Pro Lys 

320 




Il« Ar9 

325 


Giu cy« i^u 


XX« 


A«P 

330 


Gly 


Glu 


Lou Gin Gly Phc 

335 


aXy Cln 


Leu Hit Pro 


Arg s«r Gla 


Gly 


VAl 


Glu 


Thr 


Lau Gly *tbx Zla 
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340 



34S 



3S0 



tyif Ser ^ scr Leu Fhe Pre JUn Ara aIa Pro A«p Gly Arg Vftl t«u 

35S 360 36S 

Leu L«u Mn fyr lie Gly oly Ala Thr Aan Thr Gly xi« V«l Mr Ly* 

370 375 380 

fhr Glu smr GIu Lmi V«l Glu Al« Val A«p Ar9 A«p L«u Arg Lyi H«t 

385 390 395 400 

Lmi tl« A«n s*r Thr Al« V^l A«p Pro Leu VaI Leu Gly V«I Arg VaI 
405 410 415 

Trp Pre ala aIa Xl& Pro CXa Phe Z#«u VaI Gly Hi« Lma Amp L«u L«u 
420 43$ 43C 

Gltt Ala Ala Ly« Ala Ala ttm Asp Arg 61y Gly TVr Asp Gly Leu Fhe 
435 440 445 

Lau Gly Gly A«n Tyr Val Ala dy Val Ala Lau Gly Arg Cya Val Glu 
450 455 460 

Gly Ala Tyr Glu S«r Ala Ser Gin lie Stf Aep Aw Leu tbr Lya fyr 
46S 470 475 400 

Ala Tyr Lya 



(2) INFORKATION rOR 5&Q tO NO:7: 

(1) 5SQ0£NC£ CHARACTERISTICS; 

(A) LnfGtH: 206: base palra 

(B) TYPE: nucleic acid 
iC) 5TRANDeD»IES5: fllndXa 
roi TOPOLOGY: linear 

(ii) KOLBCOLE TYPE: CDHA 

(iii) fOTOTKETICAL: 1» 

(iv) AKTI-SCMSC: HO 

(iX) FEMURE: 

(A) KAMS/XSY: COS 

(B) LCCATZON: 64.. 1698 

(D> OTHER INPORMATZOlf: /not^' 'Haise CENA; 
aequenea fron plffiC-S* 

Ixi) SEQUQiCe OESCRZPTZCNs SCO XP MD:7; 

CTCTCCTACC TCCACCTCCA CGACAACAAG CAAATCCCCA TCCAGTOCA AACCCTAACT 

CAA ATGCTCOCTTTGJUTTCCCTCAOCCTCATCCGCT TCC TCC CAT CCT 
M«t Leu Ala Leu Thr Ala Ser Ala $er Ser Ala Scr Ser Hlv Pro 

15 10 IS 

TATCaCCACGCCTCCGCCCACACrCCTCCCCCCCXCTACGTOCGCtC 
Tyr Arg Hla Ala 5«r Ala uia Thr Arg Arg Pro Arg Leu Arg Ala Val 

20 25 30 



60 
109 
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m 




Mm 





mi 



CTC <SCG ATG GCG CCC TCC GAC GAC CCC COT OCA GCC CCC OCC AGM ICC 204 
L«U AlA MC Ala Gly S«r Aap A«p Pro Arg Aim A^a Pro Al* A^ff 
35 40 45 

GTC CCC CTC CTCOGCGCCGGCCTCAGCOOecrCGCC OCC CCG TAC AGO 253 
val A3.A y^X VAl Gly Ala Oly V4X 5«r GXy L«u Ala Ala AU Tyr Arg 
SO SS 60 

CTCAGACAGAGCOGCCTCAACGTAACCCTGrtCGAACCGGCCGACAOG 30D 
L«u Arg Gip S«r Cly Val Aan Val Thr Val Fh« Glu Ala Ala Aop Arg 

65 10 7S 

GCG GGA GOl AAG ATA COG ACT AAT TCC GAG QGC OGG TTT GTC 1QS GAT 340 
Ala Gly Gly tyc Zl« Arc Thr A«r. sor Glu cly cly Pbc v«l Tjrp A«p 
BO 05 90 95 

caAAOGAGCTAACACCAtGACAGAAGCrGUTOGGAGOCCACTAGACTG 396 
Glu Cly Ala Aon Thr Mac Thr Glu Cly Glu Tip clu Ala Sar Arg im 

100 lOS 110 

ATT GAT SAT CTT QGT CTA CAA GAC AAA CAG CAG TAT CCT AAC TCC CAA 444 
lie Aap A«p Lau Gly Leu Gin Asp Lya Gin Glu Tyr Pro Aon Sar Gin 

lis 120 135 

CAC AAG CCrr TAC A17 GTC AAA GAT GGA GCA CCA CCA CTG ATT CCT TCO 493 
KiB Lys Arg Tyr Xl« VkI Lya Asp Gly Ala Pro Ala L«u Il« Pro Sor 

no 13S 140 

GAT CCC ATT 1>06 CTA ATG AAA AGC ACT CTT CTT TCC ACA AAA TCA AAG 340 

Asp Pro lie S«r Leu Met Lys Scr ser val Leu S«r Thr Ly* S«r Lya 

14S 150 iss 

ATT GCC TTA TTT TTT CAA CCA TTT CTC TAC AAG AAA GOT AAC ACA AGA SOB 
Il« Ala Leu Phe Fh« Glu Pro Phe Leu Tyr Lya Lya Ala Aon Thr Arg 

160 16S 170 175 

AAC TCT GGA AAA CTG TCT GAG GAC CAC TTG ACT GAC ACT GTT OGG ACC 636 
Aan ser Gly Lys Val Sar Glu Glu Ki8 Lau Scr Glu S^r Val Gly Sar 
180 165 IdO 

TTC TCT GAA CGC CAC TTT GGA AGA CAA GTT GTT GAC TAT TTT GTT GAT 664 
phe cys Clu Arg His Phe Gly Arg Glu Val Val Aop lyr Ptie Val Aop 
195 200 205 

CCA TTT GTA GCT GGA ACA ACT OCA GCA GAT CCA G*C TCA CTA TCT ATT 732 

Pro Ph« Val Ala Gly Thr Ser Ala Gly Aap Pro clu Ser Lou Scr lie 
210 215 220 

CCT CAT GCA TTC CCA GCA TTG TOC AAT TTG GAA AGA AAC TAT GCT TCA 7B0 

Arg His Ala Phc Pro Ala L«u Trp Aan Leu Gl-x Arg Lya Tyr Gly S«r 

225 230 235 

CTT ATT GTT GCnr CCC ATC TTC TCT AAC CTA CCA GCT AAA GOT GAT CCA 820 

val lit v«ii Gly Ala Xlc Leu Ser Lys Leu Ala Ala Lya cly asp Pro 
240 245 35Q 255 

CTA AAC ACA AGA CAT GAT TCA TCA OOC AAA ACA AOC AAT ACA CCA CTG 876 
Val Lys Thr Arg Hi9 A&p Ser sar Gly Lya Arg Arg Aan Arg Arg Val 
260 265 270 

TCn TTT TCA TTT CAT GCT GGA ATC CAC TCA CTA ATA AAT CCA CTT CAC 
Scj Phe Sar Phe His Cly Cly Mac cln Ser Leu He Asn Ala Leu Hxi 
27S 260 285 
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AAT GAA err CX2A GAT GAT MT GTC AM3 C1T OCTT ACA OAA OTO TTS tCA 
Aan Glu Val Gly A»p A4p A«n Vol Lya Gly Thr Olv V«l L«u 9eif 
390 395 300 

TOG»TGTACAmGATOCAGW<XTGCACTA6GCAC5CTOCTCAATT 
L«U AlA Cyi Dir tlie A«p Gly V«l rrr aIa Gly Arg tfp S#r tim 

305 310 315 

TCT cn GAT TCG AAG GAT AGCGCTaACAAGGACCTTOCTAGtXACCAA 
341: VAl AMP Sttr liys A«p Sar Gly A«p Ly» A«p AIa 5«r A«n OlA 

330 32S 330 335 

ACC TPr GAT CCT CTTT ATA ATG ACA OCT CCA TTO TCA AAt GTC CGC AGG 
Thr Phm Xmp Ala Val lie Mat Thr Ala Pro Set Aan Val Arg Arg 
340 345 350 

ATO AAG TTC ACC AAA GGT GGA GCT CCG CTT GTt ClT GAC TTT CTT OCT 
Mat tya Phe Thr Lya Gly Gly Ala Pra Val val L«u A«p Pha Lvu pro 
3S5 360 3tiS 

AAG ATG GAT TAT CTA CCA CTA TCT CTC ATG GTG ACT OCT TTT AAC AAG 
Lys Hat Aap Tyr Lau Pro Leu S«r Lau Hat Val Thr Ala Pha l#ya Lya 

370 375 380 

GAT GAT CTC AAS AAA CCT CTC? GAA OOA TTT GGG GtC TTA ATA CCT TAC 
Aap Aap Val tya Lya Pro Lau Glu Gly Pha Gly Val Lau Xla Pro Tyr 
3BS 390 395 

AAG GAA CAGCAAAAACATGCTTCtGAAAACCCrr GOG ACT CTC TTT TCC 
Lya Glu Gin Gin Lyj Hia Gly Leu L/a Thr Lau Gly Thr Lau 9hm Ser 
400 405 410 415 

TCA AT5 ATG TTC CCA GAT CGA CCT CCT GAT CAC CAA TAT TTA TAT ACA 
sar Hat nee Phe Pro Aap Arg Ala Pro A^ Aap Gin Tyr L£'j Tyr 3far 
420 425 430 

ACA TTT GTT G!5G GGT AfiC CAC AAT AGA GAT CTT OCT GGA GCT CCA ACG 
Thr Phe Val Gly GXy Scr Hia AaA Arg Aap Leu Ala Gly Ala Pre Thr 
435 440 44S 

ret ATT CTG AAA CAA CTT CTC ACC TCT GAC CTT AAA AAA CTC TTC OCC 
s«r lla Lau Lys Gin Leu Val Thr Ser Aap Lau Lya Lya Lau Lau Gly 
4S0 4SS 4eo 

CTA GA:^ GGG CAA CCA ACT TTT GlC AAG CAT CTA TAC TOG GGA AAT GCT 
Val Glu Cly Gin Pro Thr Phe Val Lya Hit Val Tyr Trp Gly Aan Ala 
465 470 475 

TTT CCT rrC TAT OGC CAT GAT TAT ACT TCT CTA TTC GXA CCT ATA GAA 
Phe Pro Leu tyr Gly Hi« Aap Tyr Ser Ser Val Lau Glu Ala Xla Glu 
400 495 490 4»S 

AAf? ATG GAG AAA AAC CTT CCA GOG TTC TTC TAC OCA GGA AAT AGC AAG 
Lys Mat Glu Lya A»n Lau Pro Gly Pha Phe tyr A].a Qly Aan 5«r Lya 
500 505 510 

CAT GGG CTT CCT CTT GGA ACT GtT ATA CCT TCA OOA AGC AAC OCT CCT 
Ajp Gly Leu Al9 Val Gly Ser Val Xla Ala Ser Gly 5ar Lya Ala Ala 

515 sao 535 

CAC crt GCa ATC TCA TAT CTT GAA TCT CAC ACC AAG CAT AAT AAT TCA 
Aap Lau Ala He 5er tyr Lau Glu Sar His THr Lya Ula Aan An -Ser 
530 535 S40 



62 



973 



1030 



10«8 



11 £4 



12X2 



13 OB 



135* 



1404 



1452 



1500 



154a 



1596 



1644 



1692 



nm\iU^)n^m 1:B S K B 



R:6 2 4 P. 67 



CAT TQMUWSWrC TGACCTXTCC TCTAflCAJTT OTCOACAMT TfCTCCMrT 174S 
Him 

545 

CAICTACAGT AGMUICClUT OaTfTOCACT TTCAOMCAT CWCACCTCT tCMATATTA 180S 

iMCCCTTGCrr GMCATCCAC CAOMAOOTA OrrC^tVTC TAACTOOOAA AATGAflOTTA 1««5 

AAAACTATTA TOOOO CC OttA AATCTTCCTT ' iVlW rrT CC TOyCM^TTOG CCtACGACAC 1925 

TTCATGTPOC AAATACATTT AAATITGTTG AATTOTTTCA OAACACATOC CtOACOTCTA 1965 

ATATTTOCCT ATTCnCATTT TA6CACTACT CTTfXiCCAaA nATOCTTTA CCCCmXAA 2045 

XAAAAAAAAA AAAAAA 20^1 




(2) ZNTORMATZON rOR SEQ ZD »0.*a: 

(i) SCQOBICS CttAJUlCTERXCTICS: 

lA) LSNQTB; 544 Mine mcidm 
(B> Ty?B: AHittO Add 
(D) TOtOLOOY: linMlT 

Hi) HOIiBCULE TTFR: protsln 

(Xi) S£OC£NC£ DESCRIFTlON: SSQ ID NO: 8: 

M«c i.«u Ala ZM Thr AI« 5«r Ala Sar S«r Al« ^ Smr flU Tyr 

15 10 15 

Ar0 His AlA Ser AX« His Thr Arg Arff Pro atv x«w Arg Al« vaI 1m 

20 25 30 

Ms Mec Als Gly S«r Asp Asp Pro Arg Ala AU Pro Als Arg Val 
3$ 40 45 

Als vsl vsl Gly Als Gly Vsl sor Gly L«w Als Als Aim tyr Arg Lsu 

SO 55 «0 

Arg Gin Smr Gly val Asn vsl Thr Vsl Pn« Giu Als Als Asp Arg Als 

fiS 70 75 80 

Gly cly Lym lis Arg 7ti« Asn 5sr Glu Gly Gly Pba vsl Trp asp Glu 
AS 90 95 

Cly Als Asn Thr M«c Thr Glu Gly Glu Trp Glu Als Scr Arg Iau Z1« 
100 105 110 

Asp Asp L«u Cly L«u ciA Asp Lys Gin Cla Tyr Pro ASfi S«r cla His 

lis 120 125 

Lys Arg Tyr Zlc Vsl Lys Aap Gly Als Pro Als Z^u Zlc Pro Smr Asp 
130 las 140 

Prr tie Ser t^u itet Lys s«r Ssr Vsl Lsu Sftr Tfir Lys S«r Ly« Zl« 
145 150 155 150 

Als Leu Phe PiM Glu Pro Ph« L«u Tyr tyv Lys Als Asn Tlir Axv An 
16S 170 175 

ssr Gly Lys vsl s«r Glu clu His Lsu Ssr Glu Ssr Vsl Gly Ssr rtm 
ISO 165 190 
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Glu 


An 

195 


His 


Ph« 


Cly 


Arg Glu 
200 


Vel v«l 


Aep Tyr Phe 

205 


Vel Aep Pra 




V«l 


Al« 


Gly 


Thr 


Mr 


All Gly 

31S 


Aep Pro 


Glu ser Xieu 

220 


Ser Xle Arg 


Ri> 


AU 




Pro 


Al» 


330 


Trp Am 


Leu Glu 


Arg Lyf Tyr 

235 


Gly Ser Val 
340 


Zl« 


vml 


Gly 


Al* 


lU 
345 




Ser tya 


Leu Ale 

250 


Ala Lye Cly 


Aep Pro Vel 

255 




Thr 


Aifg 


H4« 

260 


AMP 


5«r 


S^t Gly 


Lyp Ar« 

365 


Arg Aen Arg 


Arg Val Ser 

370 




Smr 


PlM 

275 


El* 


Gly 


oly 


Met Gin 
200 


Ser Leu 


lie Ant Ala 


Leu Kle Aen 


Glu 


V«l 

290 


GXy 


A»p 


A»p 


Am 


VaI Lye 

295 


Leu Gly 


Thr Glu Vel 

300 


Leu Ser Leu 


Ala 


cy« 


Thr 


ph« 


A«P 


Gly 
310 


Vel Pro 


Ala Leu 


Gly Arg txp 
315 


Ser Xle Scr 

320 


VmX 


A«P 


5«r 


Lyt 


325 


S«r 


Gly Aep 


Lye Aep 

330 


Leu Ale Ser 


Aen Gin Thr 

335 


Phft 


A*P 


A2« 


Val 
340 


lie 


Met 


Thr Ale 


Pro Leu 
345 


Ser Aen Val 


Arg Arg Met 

350 


Ly. 


?h« 


Thr 
35S 


Ly* 


Gly 


Gly 


Ale Pro 

360 


Val Vel 


Leu Aep phe 

365 


Leu Pro Lye 


Httt 


Ainp 

370 


Tyr 


L«U 


Pro 




Ser Leu 

37S 


Met Vel 


Thr Ale Phe 
380 


Lye Lye Aep 


ASP 

365 


vca 


Lys 


x-y* 


Pro 


Leu 
390 


Glu Gly 


Phe Gly 


Val Leu lie 

395 


Pro Tyr Lye 
400 


Glu 


Gin 


Gin 


Lyo 


Hia 
405 


Cly 


Leu lore 


xnr Leu 

410 


Gly Thr Leu 


»M S«r ser 
415 


N«C 






Pro 
420 


A«p 


Arg 


Ale ?r£i 


Aep Aep 

i2S 


Cln Tyr Leu 


Tyr ^tr Thr 

430 


Phe 


v*l 


Gly 
435 


Cly 




His 


Aen Aro 
440 


Aep Leu 


Ala Gly Ala 
445 


Pro Thr Ser 


He 


450 


Ly» 


Gin 


Leu 


val 


Thr Scr 
455 


Aep Leu 


Lye Lyo Leu 
460 


Leu Gly val 


Glu 
4«5 


Cly 


Gin 


P«> 


Ttur 


Phe 
470 


Vel Lye 


Hie Val 


Tyr Trp Gly 

475 


Aen Ala Ph« 
4B0 


Pro 


L«U 


Tyr 


Gly 


Hi« 
465 


A*P 


Tyr ser 


ser Val 
490 


Leu Glu Ale 


Xle Glu Lye 
49S 


K*t 


Glu 




500 




Pro 


Gly Phe 


Phe Tyr 
505 


Ala Gly Aen 


ser Lye Aep 

510 


Cly 


L«U 


AU 

SIS 


VaI 


Gly 


Ser 


val lie 
520 


Ala £«r 


Gly Ser Lye 

52S 


AlA Ala Aep 


Lttu 


AlA 


11* 


S«r 


Tyr 


Leu 


Glu ser 


Hie Thr 


Lye Hie Aen 


Aen ser Hie 
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530 



535 



540 



(2) INTORMATIOK TOR SEQ 11> H0:9: 

(1) SSQUBNCE CHARACTKAISTZCS: 

(A) LOStGVH: 1811 h»fla pair* 

(B) TYPE: nuclAic Acid 

(C) STSIANDEQNSSS: liJigle 
(P) TOFOLOCY: linMr 

Ui) KOI-BCnLE nrPE: CDKJk 
(ill) RYMnfETXCAL: HO 



<X) MMfE/W: CDS 

(B1 LOCATION; 3.. 1589 

(Df OTHER iwgiuarzoff: 



/produce* *«hMC pr0CO>c-l cDM" 





(Xi) DESCRZmOM: ID ISO.^t 

OC OCA ACA ATS <3CC ACC OCC ACC «TC 0C«5 CCC OCC TC© CCC CTC COC 4T 
Ala Thr AlA Thr aIa Thr V«l Ala Ala Al* Smr pro 1mm Arg 
15 10 15 

OOC ACS GTC ACC OOG OOC CCA CAC CGC GTC CGC COG CGT TOC OCT ACC 
Cly Arfl VAl Thr Gly Arg ?r6 Hi* Arg Val Arg Pro Arg Cy« AI4 Thr 
20 25 30 

GCG AOC AGC CCq ACC GAG ACT CCC OCG CCC CCC QOC GTQ COS CTG TCC 143 
Ala s«r Ser Ala Thr Glu Thr Pro Ala Ala Pro Gly V»l Arg IM smv 
35 10 45 

GCC GAA TGC GTC ATT GTO GOC CCC OGC ATC AGO CGC CtC TGC ACC GCG 191 
Ala Glu Cys val Xltt Val Gly Ala Gly Zle S«r Gly Cya Thr Ala 
SO 55 «0 

CAG OCG CTG GCC ACC CGA TAC OGC GTC AOC OAC CTC CTC GTC ACG GAG 239 
Gin Ala Lav Ala Thr Arg Tyr Gly Val $«r Aap L«u L«u Va?. Thr Glu 
€5 70 75 

GCC CGC GAC CGC CCC GGC GOC AAC ATC ACC ACC GTC GAG CGT CCC GAC 287 
;kla Arg Aap Arg Pro Gly Gly Aan lie Thr Thr Val Glu Arg Pro Aap 
60 85 90 95 

GAG GGG TAC CTG TGC GAG GAG CGA CCC AAC AGC TTC CAC CCC TCC GAC 33S 
Glu Gly Tyr Leu Trp Glu Glu Gly Pro Aan S«r Phc dirt Pro Ser AJp 
100 lOS 110 

CCC GTC CTC ACC ATC GCC CTG GAC AGC GGG CTC AAC GAT GAC TTC CTG 3183 
Pro VAl 'Lau Thr Mac Ala Val Aap Sar Gly hmx Lya Aap Asp Lau val 
115 120 125 

TtC GOG CAC CCC AAC CCC CCC COG TTC Gt^ CTG TQS GAG GOG AAG CTG 431 
Phe Cly Aap Pro A«n Ala Pro Arg Phe Val Leu Trp Glu Gly Ly* Leu 
130 135 140 

AGO CCC CTG CCC TCG AAG CCA CGC GAC CtG CCT TIC TTC ACC CTC ATG 479 
Arg Pro val Pro Ser Lya Pro Gly Aap Lau Pro Pba Phe Ser Leu Met 
Its 150 ISS 

65 
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J^CTATCCCTCaOCAACCTCAOCOCCXXXrCTrOGCOOScrcOCCATTCOC 
Sttr Xl« Pro Gly Ly* L«u ^rg Ala ,Gly Gly Al« U«u Oly tl« Arg 
160 155 170 X75 

eCACCTCCTCCAGCWCGCGASGACTCCOroCACGACW 
ETo Pro Pro Pro Gly Ax^ Glu Clu Smr VaI Glu Glu Pho V*l Arg Arg 
180 ISS 1^0 

AAC etc GOT CCC GAG CTC TTT GAG CCC CTC ATC GAG CCt TTC TGC TCA 
hMxx Lttu Gly Al« Glu V»l Ph« Glu Arg L«u Xl* Glv Pro Pha cy 3«r 
195 200 205 

OCT OTA TAT GCT tSST GAT CCT TCO A*6 CTT ACT ATG AAS CCt OCA TTT 
Gly VAl TVr Alt Gly A«p Pro Bmr Lyi L«u S«r Hoc U^b Ala All Fh* 
210 315 230 

OOO AAG GTC ITO AGG TtG GJM3 GAG ATT GGA GGT ACT ATt ATT OGT GGA 
Gly Lys v*l Trp Arg L«u Glu Glu ll* Gly Gly S«r Xla XI« Gly Gly 
235 330 235 

ACCATCAAGGCGATTCAGGATAAAGGGA^AACCCCAAACCCCCAAGG 
Thr lie Lys Al« zl« Gin A#p tym Gly Lym Asn Pro Lya Pro Pro Arg 
340 345 2S0 355 

GATCCCCGACTrCCGGCACCAAAGGGACAGACGGTGGCATCTnCAGG 
Avp Pro Arg Lftu Pro Al« Pro Lys Gly Gin ttir vaI Ala S«r Phe Arg 
360 265 270 

AAC OCT CTA GCC AtG CTC CCG AAT GCC ATC GCA TCT ACG CTG GGT ACT 

Ly« Gly L«u Ala M«e Lau Pro Ant AlA lie Ala Scr Arg teu Gly Scr 
275 280 265 

AAA GTC AAG CTG TCA TOG AAG CTT ACG ACC ATT ACA AAC OCG GAC AAC 
Lye Val Ly« L«u Sax Trp Lya Leu Thr Set 11^ Thr Lya Ala Asp Am 
290 295 300 

CAA GGA TAT GTA TTA GGT TAT GAA ACA CCA GAA OGA CTT CTT TCA GTG 
Gin Gly Tyr Val Lau Gly Tyr CIu Thr Pro Glu Gly Val Sar Val 
305 310 315 

CAC CCT AAA ACTT GTt ATC ATC ACC ATC OCG TCA TAT CTT GCr ACT GAT 
Gin Ala Lys Sar Val Ila Mat Thr lie Pro S«r Tyr Val Ala 5«r Aep 

320 325 330 335 

ATCrrcCGCCCACTrTCAATTGATCCAGCACATCCACTCTCAAAArtC 
II* Leu Arg Pro Leu Ser lie Asp Ala Ala Asp aIb Leu Ser Lya Pha 
340 345 350 

TAT TAT CCG CCA GTT CCT CCT GTA ACT CTT TCA TAT CCA AAA GAA GCT 
Tyr Tyr Pro Pro V«l Ala Ala Val Thr Val Sar Tyr Pro Lys Glu Ala 
3SS 360 365 

ATT ACA AAA GAA TGC TTA ATT GAT OGG GAG CTC CAG CCT TTC GCC CAC 

Il« Arg 4iys Glu Cys Leu Ila Asp Gly Glu Lau Gin Gly Phe Gly Gin 
370 375 380 

TTG CAT CCA CCT AGC CAA GGA GTC GAG ACT TTA GOG ACA ATA TAT AGC 

Lau Hit Pro Arg Sar Gin Gly Val Glu Thr Lau Gly Ttir lla Tyr Sar 
385 390 395 

TCT TCT CTC TTT CCT AAT CCT GCT CCT CCT GGA AGA CTO TTA CTT CTG 
Ser sar Lau Pha Pro Asn Arg Ala Pro Ala Gly Arg Val Leu Leu Leu 
400 . 405 410 415 

66 



527 



S7S 



623 



671 



719 



767 



015 



863 



911 



959 



1007 



1055 



1103 



1151 



1199 



1247 
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AAC Tht ATC GGO GGT TCT ACA AAT ACA OOG ATC QIC TCC AAC ACT GAC 
Aki Tvr ll€ Gly Gly S«r Thr Aan Thr Cly !!• VaI Mr Ly« Tnr GlU 
420 42$ 430 

ACT GAC TTA GTA GGA CCC GTT GAC CGT QAC CTC AOA AAA AT6 nt» ATA 
Sttr Asp L«u VaI Gly AIa V9I asp Arg Asp L«u Arg Ly« H«C L«u Il« 
43S 440 44S 

AAC OCT ASA CCA GCA GAC OCT TTA OCA TTA OGO CtT CGA GTG TOG CCA 
ASA Pro Arg AIa Ala Aap Pro L«u Ala Lau Gly Vsl Arg Val Trp 9ro 
4S0 4S5 460 

CAA GCA ATA CCA GAG TTT TTG AlT OGG CAC CTT CAT CGC CTT OCT OCT 
Gin Ala Il« Pro Gin Phe Leu lie Gly Hifl L«u Asp Arg Uu AU AI« 
46S 470 475 

OCA AAA TCT GCA CTC 60C CAA GOC GGC TAC GAC GOG TTC TTC CTA GGA 
Ala Lys Ala Lsu Gly Gin Gly Gly Tyr Asp Gly l*ou Phs Lsu Gly 
480 4Sb 490 495 

GGA AAC TAC OTC GCA GGA GT? OCC TTG OGC CGA ATC QMS GOT OCG 

Gly Ami Tyr Val Ala Gly v«l a1« Leu Gly Arg Cys lie Glu Gly Als 
5O0 SOS SlO 

TAC GAG ACT OCC TCA CAA CTA TCT GAC TTC TTG ACC AAG TAT CCC TAC 
Tyr Glu Ser Als ser Gin Vsl Ser Asp Phs Lsu Thr Lys Tyr Als Tyr 

SIS 520 535 

AAG TOA TOGAAGTAGT GCATCTCTTC ATrTTGITGC ATATACGAGG TGAGGCTAOG 

tys 

ATCOGTAAAA CATCATGAGA TTCTCTACTC TTTCTTTAAT TGAAAAAACA AATTTTACTC 
ATOCAATATC TGCTCTTTCC TGTAGTTOGA GCATGTACAT CGGTATGGGA TAAM»AGAA 
TAAGCTATTC TGCAAAAOCA GTGATTTTTT ttoaaaaaaa aaaaaaaaaa aa 



129S 

1343 

1391 

1439 

1407 

1S3S 

1583 

1639 

1499 
1759 
1811 



(2> ZNTORMATIOM FOft SCQ ID NO: 10: 

<i» SBQOGtlCE CHAAACTStZSTICS: 

(A) La«GTH: S28 Amino sci4s 

(B) nrPE; amino acid 
iD) TOPOLOGV: linear 

Hi) MOLECULE TYPE: procein 

(XL) SE0UO7CE DCSCHIFTION: SEQ ID NO:10t 

Ala Thr Hac AIa Thr Ala Thr Val Ala Ala Ala Ser Pro L«u Arg Gly 
IS 10 15 

Arg Val Thr Gly Arg Pro His Arg Val Arg Pro Arg Cys Ala Thr Ala 

20 25 30 

5ar sar Ala Thr Glu Thr Pro Ala Aia Pro Gly Val Arg Leu 5cr Als 
35 40 45 

Glu cys val lie val Gly Ala Gly zle Ser Gly Leu cys Thr Ala Gin 

SO «S 40 - 
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70 



75 



90 



.-a A»p Arc Pro Cly cly Am Il« Thr Thr v«l clu Arg Pt& Amp olu 
8S 90 9S 

GXy Tyr L«u rrp alu Clu cly ?ro Am Ser P}i« Gin Pro s«r A#p Pro 
100 105 110 

V»l Leu Thr K«t Ala Val A«p S«r Gly Lm ty> Asp X^p Val Phm 

lis 120 125 

Gly Asp Pro Kmti Ala Krg PHe val L«u Trp Clu Gly Lys b«u Ar? 

130 135 140 

Pro val Pro ser Ly« pro cly xwp Uu Pro Phe Phtf S«r LafU Hat &«r 
.145 150 15S ISO 

Ila pro Gly Lya Lcu Arg Ala Cly Lau Cly Al« Leu Cly ZX« Ary Pro 

16S 170 175 

Pro Pro Pro Gly Arg Clu clu 5«r Val Glu Clu Pht Val Arg Arg Aan 

190 195 190 

Leu Cly Ma Olu Val Phe Clu Arg Uu Xla Clu Pro Pha Cya dcr Cly 
IPS 300 305 

val ryt aIa cly Aap Pro sar Lya Lau 5«r Hmt Lya Ala Ala Ph« Gly 

210 215 35' 

Lya val Trp Arg L«u clu Clu lie Cly Cly Ser Xle Xle Gly Gly Thr 
23S 230 335 340 

Xla Lya Ala Zle Cln Asp Lya Cly Lya Aan Pro Lya Pro Pro Azg Aap 
34$ 350 355 

pro Arg Leu Pro Ala pro Lys Cly Gin Thr Val Ala Ser Phe Mrg Ly* 

260 3€5 370 

Gly Lau Ala Mat Laa Pro Aan Ala Zla Ala Ser Arg Leu Cly SeT Lya 

375 360 385 

val Lya Leu sar Trp Lya Leu Thr Ser lie Tbr Lys Ala Aap Aan Gin 

390 395 300 

Gly tyr Val Leu Cly Tyr Clu Thr Pro Glu Gly Uu Val Ser Val Gin 

305 310 315 330 

Ala Lya Ser Val He Met Thr Zle Pro Ser Tyr Val Ala Sar AaP Zle 

325 330 335 

Leu Arg pro Leu Ser Xle Aap Ala Ala Aap Ala Leu Ser LyO Phe Tyr 
340 345 350 

Tyr pro pro val Al« aIa Val Thr Val fiar Tyr Pro Lya Glu Ala lie 
355 360 365 

Ar9 Lya Clu Cya Leu He Aap Gly Glu Leu Oln Gly Phe Qly Gin Leu 

370 375 390 

Hia Pro Arg Ser Gin Gly val Glu Thr Leu Cly Thr Zle Tyr Scr Ser 
365 390 395 400 

Ser Leu Phe Pro Aan Arg Ala Pro Ala Cly Arg VaI Leu Leu Leu Aan 
40S 410 4X5 



5 
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Xry» ttir Glu S«r 
430 

MAC L«U 2X« AMI 
44$ 

Val Txp Pro Gin 



L«u AlA Mm AlA 

phft Lttu Gly Gly 
495 

Glu Gly Ala Tyt 
SIO 

Tyr Al« Tyr Ly» 

525 



*ryr lit Gly Gly S«r Thr Ain Thr Gly He v»l s«r 

420 

Asp L«U Vftl Gly Ala V«l K*P Arfl Asp L«u A*g Ly« 

435 440 

Fro Arc Alt Ala A«p Pro Leu Al4 l^u Gly Val Arg 
450 455 460 

AU Il« rro GlfJ Ph« L«i lie Gly Hie l«u Ajp Arg 
465 470 475 

Wfl Smv Ala Gly Gin Gly Gly Tyr Aap Cly 1^ 
485 490 

Aan Tyr val Ala Gly Vai Ala Lcu Gly Arg Cym lla 

SCO SOS 

Glu S«r Ala Sar Gin val fiar Aap Ph« Thr Lyv 

515 B20 



(21 ZNTOIIIIAtZON FOR SBO ZD 190$ lis 

(A I WWCTM: 1847 baac pain 

(B) TYPE: nucloiC acid 

(C) STRM03£ZXf£SS: single 

(D) TOPOIOCT! linaar 

(ii) KOLBCULE TYPE: CIWA 
(iiil mrPOTHETlCAL: WO 



rA> rOHE/KEy: cos 

ID) LOCATIONS 55.. 1663 

(D) OTHER mmwiATlON: /product* "aoyb^an prot03(-l c£HA" 

{Xi) SEQUEl^E PESCRimON: SEQ ID 110:11: 
CTTTAOCACA GTCTTCAACA TAACGAACGX ATACTOCCAT TACTOTAACC AACC ATC 

1 



57 



CTT 
V«l 


TCC 

Ser 


GTC 

y^x 


TTC 
Pha 

5. 


AAC 
Aan 


GAG ATC 
Glu Zla 


CTA TTC CCG 
L-au Ph* Pro 
10 


CCC AAC CAA 
Pro Aan Gin 


ACC 
Thr 
15 


CTT CTT 
Lau Lau 


105 


CGC 
Arg 


ccc 

Pro 


TCC 
sar 

)0 


CTC 
Lau 


CAT 
(Ua 


TCC CCA 

s«r pro 


ACC TCT TTC 
Tttr sar Pha 
25 


TTC ACC TCT CCC ACT CGA 
Pha Thr Sar Pro Thr Arg 


15} 


AAA 


TTC 
Phe 
35 


ccr CGC TCT 
pro Arg Sar 


COC CCT 
Arg Pro 
40 


AAC CCT ATT 
hMTi Pro Zla 


CTA CGC TOC 
Lau Arg CVa 
45 


TCC 
sar 


ATT OOt 
Zla Ala 


201 


GAG 
Glu 
50 


GAA 


TCC 
Sar 


ACC 
Thr 


GCG 
AJ.a 


T^T CCG 
Sar Pro 

55 


CCC AAA ACC 
Pro Lys Thr 


AGA GAC TOC 
Arg Asp Sar 
60 


GCC 
Ala 


CCC GTG 
Pro val 
€5 


349 



r 



I 

P 
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qAC TCaC GTC OK GTC GCC GGK QOC OTQ MOC OOC CTC TQC ATC OCC 00 297 
p Cy* VAX Val V^l Gly dy 61y V«l 8«r Gly l«u Cys n« Al« Gift 
70 75 90 

GCrCTCOCCACCAJUCJ^OCCMTOCCAACCTrCCnCGTCACGGM 345 
Ala Zicu AlA Thr Lya Kif aIa A*n Aid A»n vaI Vsl vel mr Clu Ala 
85 90 95 

CCA OAC CC3C GTC OOC GCC AAC ATC ACG ACG ATC GAG AGG GAC GGA TAC 39) 
Ara Asp Xr? Val Gly Gly A«n lit Thr Ttu- Met Glu Arg A«p Gly Tyr 
100 105 110 



V >^ 



CTC TGG GAA GAA GOC CCC AAC AGC TTC CAG CCT TCT GAT CCA ATG CTC 441 
l^u Trp Glu Glu Gly Pre Asn Ser Pha Gin {^o Smt XMp Pro Mt Lmx 
IIS 120 125 

ACC ATG GPG GTO GAC AGT GGT TTA AAG GAT GAG ClT OTT TIC QGG GAT 469 
Thr MAC VaI v«1 Aflp smT Gly Lou ty* A«p Glu Vmx VaI lcu Gly A«p 
130 135 140 145 

CCT GAT GCA CCT CGC TTT GTG TTC TGG AAC AGG AAC TfC AGG CCD <7tG 537 
Pro Asp Ala pro Arg Ptom Val 1m Trp Asn Arg Lyt Lau Arg Pro Val 
150 155 160 



1 



CCC OOG AAG CTC ACT GAT TTG CCT TTC TTT GAC TlG ATG AGC ATT OCT 5«S 
Pro Gly Lya ii«u Thr A«p Lau Pro Pho Pbm Asp Lau Nat Smr ila cly 
165 170 175 

GOC AAA ATC AGG OCT GOC TTT GOT GCG CTT OGA ATT OOG CCT CCT CCT €33 
Gly lym lit Arq Ala Gly Phe Gly Ala tau Gly Xle Arg Pro Pro Pro 
180 185 190 



CCAGGTCATGAGGAATCGCTTGAACACTTTGlTCGTCOGAACCTrOGT 681 
Pro Cly His Glu Glu 5ar val Glu Glu Phe Val Arg Arg Asn Lau Gly 

195 200 205 

GAT GAG CTT TTT GAA CGC TTC ATA GAG CCT TTT TCT TCA OGC GTC TAT 729 
A^T Glu VAl Phe Glu Arg Lau He Glu Pro Phe cya Ser Gly Val Tyr 
210 215 220 225 



OCAOaCGATCCTTCAAAArEAAGTATGAAAOCAGCATTCGOG AAA OtT 111 
Ala Gly A«p Pro Sar Lys Leu Sar M<c Lya Ala Ala Pha Gly Lya Val 
230 235 240 

TGG AAC CTC CAA AAA AAT GOT GCt AGC ATT ATT OCT OGA ACT TTC AAA B25 

Trp Lya Leu Glu Lya asa cly Gly Scr Zla Zla cly Gly Thr Pha Lya 
245 250 255 

GCA ATA CAA GAG AGA AAT GCA GCT TCA AAA CCA CCT CGA GAT CCG CCT 873 
Al« Zla Gin Glu Arg Aan Gly Ala Sar Lya Pro Pro Arg Asp Pro Arg 

260 265 270 



CTG CCA AAA CCA AAA GCT CAG ACT GTT GCA TCT TTC CGS AM5 GGA CTT S31 
Lau Pro Ly» pro Lya cly Gin Thr Val Cly Sar Pha Arg Lys Gly Lau 
^ 275 2B0 2BS 

ACC ATC TTG CCT GAT GCA ATT TCT GCC AGA CTA GGC AAC AAA GTA AAC 96» 
Thr Hat Leu Pro Asp Ala Ila Sar Ala Arg Leu Gly Asa Lya Val Lya 

290 29S 300 305 

TTATCTTGCAACCTTTCAACTATTACTAAACTCGATACTCCAGAGTAC 1017 
Lau £ar Trp Lya Lau Sar Ser lie S«r Ly« LAU Asp Sar Gly Glu Tyr 

310 315 320 
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ACT TTG AC* TAT GAA ACA CCA GAA GGA GTO CfTf TCT TTC CAC TOC AAA IOCS 
Smr Leu Thr Tyr Clu Thr Pro Glu Cly v«l V»l S«r L«u Oln CV« Ly< 
32S 330 335 

ACT err etc CTC ACC ATT CCT TCC tat CTPT OCT act ACA TIC CtQ OC?r 1113 
Thr V4l v«l tM\x Thr lie Pro Smr Tyr val Al« Ser Thr L«u Leti Arg 
340 3^5 390 

CCT CTO TCT OCT CCT GCT OCA GAT GCA CTT TCA AAC TfT TAT TAC CCT 1161 
Pro L«u Scr Al« Al4 Ala AIa A«p Ala L«u S«r Ly« Phe Tyr Tyr Pro 

3SS 360 3«5 

CCACTTOCTGCACnTTCCATATCCTATCCAAAAGAAOCtATrACATCA 120d 
Pro V»l Al« Ala Val ser lie S«r Tyr Pro Lym 61u Ala lie Arg Smr 
370 375 3aO 385 

GAA TCC TPG ATA GAT GGT GA6 TTG AAG GOG TTT OtST CAA TtG CAT CCA 1357 
Glu Cym L«u Zlc A»p Gly Glu l*cu tyt Cly Phe C-ly ClR L*u «ia Pro 
390 3»S 400 

OOlAGCCAAGGAGTGGAAACATTAGCAACrATATACAGCTCATCACTA 1305 
Arg sar Glr Gly val Clu Thr Lau Gly Thr xla Tyr Sar sar L«u 
40S 410 41S 

TTC CCC AAC CCA CCA CCA CCT OCA AOS CTT CPA CTC TTC AAT TAC ATT 1353 
Pbe Pro Aan Arg Ala Pro pro Gly Arg Val Lau Lau Lau Aan Tyr ila 
4Jg 42S 430 

GGA GGA OCA ACT XxT ACT OGA ATT TTA TCG AAG ACG GAC ACTT GAA CTT 1401 
Cly Gly Ala Thr Aan Thr Cly Ha Lau sar Lya Thr Aap Smz Glu Imi 
43S 440 44S 

CTG GAA ACA GTT GAT CGA GAT TTC AGG AAA ATC CTT ATA AAC CCA AAT 1449 

Val Glu Thr v&l Aap Arg Aap Lau Arg Lyfl lie Lau lie Aan Pro Aan 
450 455 460 465 

GCCCAGGATCCAmGTACTGGOOCrCAGACTGTGGCCTCAAOCTATT 1497 
Aia Cln Ajp Pro Pha vol v»I Gly V«l Arg Lou Trp Pro Gin Ala Ila 
470 475 4a0 

CCA CAG TTC TTA GTT CGC CAT CTT GAT CTT CTA GAT GTT GCt AAA CCT 1545 
Pro Gin Fha Lau V«l Gly K^t Lau Aap Lau Lau Aap Val Ala Lya Ala 
495 4»0 455 

TCT ATr ACA AAT ACT CGC TIT GAA OCG CTC TTC CTT GCSC OCT AAT TAT 1593 
Ser lie Ary AAA Thr Cly Phe Glu Gly Lau Phe Leu Gly Gly Asn Tyr 
^00 SOS SIO 

crrc tcr GGT (jtr occ tto gga cga tgc gtt gag oga gcc tat gag GTA lfi41 
VaI sar Gly Val AIa Leu Gly Arg Cya Val Glu Gly Ala Tyr Giu Val 
SIS 520 S2S 

CCA GCT GAA CTA AAC GAT TTT CTC ACA AAT AGA GTC TAC AAA 1633 

Ala AIa Glu val Aan Ajp Pha Leu Thr Aan Arg val Tyr Lya 

530 53S 540 

TACTAOCACT TTTTCTTTTT CTCCT GG AAT GGC T G ATGCC ACTCTCGTCT TCCATTGAAT 1743 
TATAATAATC TGAAACTTTC TCA-'xATTCCT TCGATAOGTT ' TTTOGCGCCT TCTATTGCTC 1803 
. ATAATCTAAA ATCCTCTTTA ACTTTGAAAA AAAAAAAAAA AAAA 1^47 



(2) INFOAMATION FOR SSQ ZD NO: lit s 
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(1) flCQOciief CKxm^ntiusTTCfli 

(B) Ttra: tmino Acia 
(0) TOPCIiOor: lliMT 

(11) NOWDLl ms: protaln 

HAt Val 84r VaI fh* Aan Glu Zl« t«u Ph« ^ fro A«n GXn Thr £m 

IS 1& IS 

liAU Arg Fro S«r L«u Mi« 5«r Ptq Thr 9«r PK« flw Thr 9wt fro Iter 
30 2S 30 

ATff ty* PIw Pro Azy S«r Ar? >ro Asn »r» 7ll« t«u A^O Cy» Mr Il« 
35 40 45 

AIa Glu Glu S«r Thr Al« S#x Pro Pro lys thr Ar« A«p Mr Ala Pt« 

50 S5 «Q 

v*l Asp Cy9 VAl y«l VAl oly Cly Gly vaI 8«r oly L«u cy« Xl« Ala 
(5 70 75 10 

Gin AlA Lflfu Ala Thr Lya Kla Ala Aan Ala Aan Val val Val Thr Glu 

85 90 95 

Ala Arg A«p Ar« Val Gly Gly Aja Ha Thr thr Mac Glu Arg Aap Gly 

100 105 110 

tyr Leu Trp Glu Glu Gly Pro Aan Sax pha GIa Pro Sar hmp Pro )lat 
115 130 135 

Lau Thr H«c val Val Aap Set Gly Im Lyt Aap Glu Lau Val («u Gly 
130 135 140 

Aap pro Aap Ala Pro Ary Pha Val Lau Tzv Aan Arv Lya Lau Arg Pro 

145 150 155 140 

Val Pro Gly Ly« Leu Thr Aap Lau Pro Pfac Pha Aap tmi Mac Sar lla 

165 170 175 

Gly Gly Lya He Arg Ala Gly Pha Gly Ala Lau Gly Ila Arg Pro ^Pro 
180 185 190 

Pro Pre Gly Kia Glu Glu Sar Val Glu Glu Pha Val Arg Arg Aan Leu 

195 200 305 

Gly Aap Glu Val Pha Glu Arg Lau He Glu Pro Phc Cy« Sar Gly Val 

210 215 220 

Tyr Ala Gly Aap Pro Ser Lya Lau 5er Met Lya Ala Ala The Gly Lya 

225 230 335 240 

Val Trp Lya Lau Glu Lya Aan Gly Gly Sar Ha Ila Gly Gly Thr Phe 

245 250 255 

Lya Ala Ila Gin Clu Arg Aan Gly Ala Sar Lya Pro Pro Arg Aap Pro 

240 365 270 



Arg Lau Pro Lya Pro Ly« Gly Gin Thr Val Qly 5ar Pha Arg Lya Gly 
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290 




Leu 


pro Aa» Ala 

395 


:ia ftar Ala Art WMi 

300 


Oly Aan 


Lya vai 




Uu 


S«r 


Trp 


Lya Lau Aar 

310 


iar Xla ««r Lya Lau 

315 


Aap §0t 


oly olu 
320 


Tyr 


Mr 




Thr 


Tyr Qiu Thr 
325 


Pro OIu Oly Val Val 

330 


Sat Lau 


Gin cya 
335 


hr$ Thr 


Val 


Val 
340 


Lau Tltf Zla 


Pro Bar Tyr val Ala 

345 


iar fhr 

350 


Lau iM 


Arg 


Pro 


355 


ter 


Al« Ala AU 


AlA A«p Ala Lm Bar 

3(0 


Lya pha 

3«5 


T*-* Tyr 


Pro 


Pro 

370 


V«l 


Ma 


Ala val Sar 

37S 


Xla gar Tyr Pro Lya 
300 


Olu Ala XI* Arg 


S«r Clu 

38S 


cyi 


Lau 


114 Aap oly 

390 


Olu Lay Ly« aiy flia 

395 


Oly Olo 


imx Hia 

400 


Pro 


Arg 




Gin 


Gly Val Gltt 
40S 


Thr Lau Oly Ttir Xla 
410 


Tyr Mr 


Aar Jar 

415 




PtM 


Pro 


Asn 

420 


Aro Ala Pro 


pro Gly Arg val Lau 
435 


Lau Lag 
430 


Aa& Tyr 


Xl« oly 


Gly 
435 


Ala 


Thr A*n Tt»r 


Cly Xla Lau S«r Lya 
440 


fhr Afl> S4tr Glu 
44S 




450 


GlU 


Thr 


Val A«9 Arg 

455 


Aop Lau Arg Ly« II* 

460 


Lau Xla 


Aitt pro 


Asn 
45S 


AlS 


Gin 


MP 


Pro Pha val 

470 


Val Gly Val Arg Lau 
475 


Trp Pro 


cla Ala 
4B0 


Urn 


pro 


Gin 


Pt» 


iM Val Gly 
485 


Hia Laiu Aap Lau 
490 


Aap Val 


Ala Lya 
4»5 


Ala 


5«r 


lie 


Ar9 

500 


Aan Thr Gly 


Pha Gin Gly Lau Pha 

505 


Lau Gly Gly Aan 

510 


Tyr val 


Ser 
515 


Gly 


Val Ala Lau 


Gly Arg Cya Val Glu 

520 


Gly Ala 

525 


Tyr Glu 




Ala 

530 


Ala 


Glv 


Val Am Aap 

535 


Pha Lau Ttir Aan Arg 
540 


Val Tyr 


Lya 


(2) 


XNFORMATXON 


FOR SEO to NO: 13; 







i 



(i) SEQUENCE CKAHACTQIXSTZCS : 

(A) UMCTU: SB3 baa* pair* 
tB) TYPE: nuclaic acid 
(C) ST1Ue<D£CNES8; ai&gla 
(D} TDPOLOCrr: liMar 

<ii) HOLBCDX^ TYPE: SNA (ganaftic^ 

(iii) HTFOTWrXCAL; HO 



(iac) FEATORE; 

(A) KAKE/KEY: proiMcar 
(Bt LOCATIOtI: l..Se3 
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pr«#ot*r' 



txi) miui0i?« oyacRimoH; sig id K):iJ: 

OMtTCCmT CQMTTATAT AATTATeAtA JIArmUATA AOCATOnOC C I'FPJ ATTAA <0 

AOACOTrtAA TAAAomoO TAATAATOOA CTTTeACrfC MACTCQMr CTCA^AA'T 120 

TAATTAATAr TTACATCAAA ATTTOOTCAC TAATATTACC AAAl-fAATAT ACTAAAATCTT 190 

tAArrCOCAA ATAAAACACT AATFCCAAAT AAAOOTl^'r f AlQATAAAC A^lffATTOAA 340 

CTTOATAAAfl CAAAOCAAAA ATAATaotTTT TCAAOOTTTa OOTTATATAT GACAAAAAM 300 

AAAAAAOCTFT TaVPTATATA TCTAmoOC PTATAACCAT mA^ACAAA PfWj gCCTA 3«0 

ACTAAIATAA TAAAATAAAC 0TAAT07TCC fi ' l - nA TATT TWTICAAAC CCAACKTAA 430 

ACCCAAAOCA AAfOAAAAAOT ATAOOOTACC OTACACAOAC TT A T UW f Q YCRGAerTOCA 490 

oaTOAATATT T LTWlWll, AiLiLLil ' ltp TirT«AaAA OATTACCCAA TCTQAAMAA $40 

ACCAAOAAflC TCACAAAATT CCGAATTCTC TQCGATTTCC A7C »«3 
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The invcntiOAM described hcidn it mw i m ii |) li mi lo tnehMte fl>t fdtowim < 

AlllbudillNUIII- 

I. An iaolaed DWA mokCTic eftoodwg a pin p < um p u r p t iy ri»o^ <wid t< |ro<o t ) 4 
■ttected from the gnwp conititinf of i ioybev> prmos enzyme «id a w*ie« pvm e«ynie- 



2. The iiolatedDNA molecute of claim I cnoodinf Mtd 
amino acid lequence Kt forth in 5EQ ID No.t2. 



3. Tim 
10 Nail. 



iaoiatod DNA molecule of datm 2 oon^ffiitiii die I 



E Ml Ml M SfiQ ID 



4. Tlie i*oi«ted IX4A meteculc ofclinti K cncodmt Mid 
amino e^aequenCBjet forth in SEQ ID No, 10. 



15 5. TheisoUtedDNAmoiectileofcbim4comprmn(d« 
No. 9. 



Mr forth m SEQ ID 



6. A DNA molocute encoding a modified pratoporphyfi^^ 

precox wterein the cysiemeocconii^ at ihepondo^ i61 eCSEQID 

20 No. 6 U teplaced with another amino add, wherein said modiTied poiox i» lolovn lo a hnfoicide 
in amounts vrhkh inhibit aaid pUm praox. 



7. TTk DNA mokculc of claim 6 whcrtin said cysteine is repUced wiA a pteiy Wanxne. 



25 8- A DNA molecule encoding a modified protopoiphynnogen ovdasetj^tTiOa) c^n^isug a plant 
precox wherein the tsoleudne occumivg at the poaibod oomsponding to amino acid 421 of 5EQ 
ID Na 6 is replaced with another amino acid, whemin said modified pimox is lokfant to a 
herbicide in amounts which inhibit said plant pcotox. 



30 9. Tbe DNA molecuk of claim 8 wherein said isokucineb replaced wiA-ai^^ 
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10. A PNA ifiolecute cnoodwg ■ wwdifwd pn mj ^o qi i y f it to§m Mu irnH v ^on ) 
pl«ni pMDt KAvtfig 1 Tint ■nmo icid lubiUMion $ni t wond urnno ■eid «ta 

ft«id frm •mtno acid nitaitiiikNi having ihe property of eoofemfig iwstsice to ipr^Mi 
9 inhibitDr Bid 

lakl lecfMid mno acM liMHMiOO 1^ 
Gcnfencd by ftkirrm amino acid iub«tttytion. 

11. ThcDNAmokcukofcUiin lOwherTOiaklaaco«darnbioacM«**^^ 
10 po»iUooir4ecied from ihe group eomMfV of 

(i) the poudon corietpocKting to die icrino «( imniM 

(ii) the poaition Gorrraponding 10 Ac ttoieocitiie am 
(ni) the poaitkm c»nc^»ndmg to die prolifie « am^ 

(iv) the position corrtspondingtod)easparigirieaiaiinioadd425ofSEQn>N0.2;a^ 
IS (v}thcposiiioocoiTMpoixfingiothciytO*if»catamd» 





12. The DNArndccufc of cUhn IK wherein *Mdfint amino acklK^^^ 
selected from the group consisting of 

(a) die pofiitioocofitspofftding to the alanine at amino add 166of SEQIDNo.6; 
20 (b) the position corresponding to the glydnc at pQsitioa 1 67 of SEQ ID No. 6; 

(c) the position cont^wrKling to the tyrosine at amino acW 

(d) the position corre^poodrng to the cysteme at amino acid 161 of SEQ ID No. 6; Ji 

(e) the position comespooding to the isoieucine at amino arid 42 1 of SEQ ZD No. 6. 



25 



30 



11 The DNA molecule of claim 1 1 wherein said second amino add substitiition occvt m 
position con«^iOAdipg to t/wscriM it amino add 305 of SEQ DD NO. 2 and said Ite 
substitution oocun at a position selected from the gmip consisting of 

(a) t»w position corresponding to die alanine at vttinoaci^ 166 of SEQ ID Na 6; and 

(b) the position cortespooding to the tyfosine at amino acid 372 of SEQ ID No. 6. 
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10 



15 



20 



25 



14 TheDNAmolecuteoreUim 13 whmm 
«fnino icid 305 of SEQ ID NO. 2 ii rvplwed 



Mid MfiflC OCCltfVJAt M pOMtKW 



f lo 



15. TheDHAmolecolcofdifmll whmm uid lecond immo icid 

pcMitkNi oMieiporMiiiif totheihf«oninei(«mnoK^ 

icid Mbititulion ooctin tf t petition Miecied 

(I) the positkm concipoiHlifig 10 the limine « Bnw i66of SEQIDNo-thind 
(b) the position comspondinf to the tyrome at emmo acid 372 of 5E0 n> No. 6. 



16. Tte IMA molecule of cUim 15 wteein thKOfiine occvmof « tffee | 
co c itop o ndi ng to imino ^ 249 of SEQ IP NO. 2 It ie|rticed widi hi Mins ecid teleeiad ^rmi 
the froup consisting of iioleocine ind alanine. 

17. The DNAnvtecukoTcUimllwhemnseidvoeondtfnino acid 

position oorresponcfing to the proline «t amino add 1 1 B of SEQ ID NO. 2 and said fim mmo 
acid substitution oecun it a position selected from the poup consisting of 

(a) the posiUoncofre^MOdingio the alanine at ammo acid 166 of SEQ ID Nol 6: and 
(h) the position corrcspoodiAg u> die tymme at amino acid 372 of SEQ ID No. 6. 

18. The DNA molecuk of claim 17 whei^ said pioline occitning at dte portion c ftiesp at din g 
to amino acid 1 IB of SEQ ID NO. 2 is replaced widi a leucine. 

19. The DNA molecule of claim U wheidn said second amino acid s ubstiDationoccun* the 
position corresponding lo the aspangine at amino acid 425 of SEQ ID NO. 2 and said fust amino 
acid substitution occuk ai a position selected from the group consisting of 

(a) the position cort^sponding to the alanine it amino acid l6iSof SEQ ID No. 6; and 

(b) tlie position corresponding lo the tyrosine at amino acid 372 of SEQ ID No. 6. 



. 20..The DNA mokcule of cUim 1 9 wherein said aspar^gine occurring at the position 
30 corresponding lo amino acid 425 of SEQ ID NO, 2 is replaced widi a serine. 
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21* TheDNAmoieeuleordiiffl 11 whmni uM mond mimio acid i 

pQtiUon corresponding CD ifae tyrotine M jmino 

ftcid lubititiitkin occttfi tl i poiitkifi tekcted from the frwp etm» mk n t of 

(i)lhepoMtMncofRspof^iii|lO(heiianineil«m^ l66c/SBQIDNo. 6; j 
(b) the pothkm comsponding ^ the tynmm m vmo mii 172 ct SEQ ID Ho, 6. 



10 



22. ThcONA motaileof ctihn21 whe<^ said CTreetne oopirmg « rt« 
to imiBO acid 498 of SEQ ID NO. 2 is rcplioed « cysMmi. 



23. The DHA rooiecuie of my of elaimt 13-23 whodn ukl tyioiifie iMxw tm g M tht pOMtmi 
oonttpondtng to BTiiiio aeid 372 of SEQ IDNo.6ititplaDed wMianamiAoieid 
the eroup consisting of cysteine, isoteucoie, leucine^ threon'me apd 




IS 24. The DNA mttoulc of cUim 12 wherein saki alanine occumnf at the ] 

to residue 1 66 of SEQ ID No. 6 is replaced widt aa ammo acid selected from the growp conaistiDg 
of vtline. threonine, leucine, cysteine and tyiDsine. 



25. The DNA molecule of claim 12 wheiein said glycine eccuniRg at de 
20 to re&iduc 167 of SEQ ID No. 6 is replaced with i mine. 



Gorfespendoif 



26. The DNA motecule of claim 12 wherein saki tfycmeooctaring it the poaition co frcapoo dii tg 
CO residue 167 of SEQ ID No. 6 is repfatted with a serine. 

2% 27. The DNA molecule of claim 12 wherein «aid cysteine occurring sr the position oonespofKling 
^ to residue 161 of SEQ ID No. 6 is replaced with a phenyUlanine 

2£. The DNA molccuk of claim 12 wherein said isolcucine occurring at thepositioo 
^ corresponding to residue 42 1 of SEQ ID No. 6 is replaced widi a tfueonine. 



R:624 



1 



IS 



29. Tlw DNA moteCttk of claim 10 whtmn Mid pM is 
muw. wh» loyhew ind >*/»W(topr<#. 



30 The DNA tiwIecufcrfcUitn 10,whefwnttidplirtpfCtoicompnw««^ 
s |fttactod(romthetrt«pcooMitin|orSEQIDN«.2,4,6 

31. Achtmcrkfeiiecoiiipri«n««pionK^ 
DNA inokCTk tftcodmg • prowp^ 



31 ThechiiwfkgciieofclMm3li*tek^ 
iiid DNA motocuk »*erftn «W »^ 
Mid DNA molecuk into the dUonapUsL 

33. The chinitfkr gew of claira 3t •ddiw^ 

said DNA molecule tnto the miioehondriL 



J6 34. A chimeric gene comprising »pfoinQterwhi^ 
molecule of claitn tO. 

35. The chiiT«ric gene of claim 34 ^fclhiooallypompTO 

uid DNA motecttk, whcreift said signal sequence is capaWe of uigeting the pwicin encoded by 
3S said DNA molecule into the chloroplasL 

^ 36. The Mwmc gene of claim 34 additionally ccmprtsing a signal sequence opmNy linked to 
said DN A mokoik; wbcrtin said signal iequcncc is capable of taigeting the pmeso encoded by 
said DNA moiccute into the mitochondria. 



30 
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37. A ncofiMnitit vectof conipnttiif thfi chffvi^^ 
I cipAle of bemg tably (fnatom w J hup > bote cdl. 



38. Ai«OKmibmmv«cioroim)pri>higthfid)tf^^ 
5 cApiMcfbdiitfUblytrmfor^ 



39. AiiModtuiMycrmfonnedwiftdwvvctOT 
expftsuof Mid DN A molocttk, 

I 

I jMitccllmiininaflacell, 

41, AptimcomprishtfiheZMAmotecukofd^ IO,wtaatini«dDNAfl»landei»eqniMd 
m «aid pUm tnd ccNkfm upon ttkl plittt ft^^ 
as natuTAlly occumng procox activity. 

41 Theplamofctaim41 whrnrnwdDNAmoiccukve^M 
oecuntog pracox coding sequence, 

20 43- A pltm comprising die chsimk gate of ciai^ 

said plinc tolenooe cd « heibicidc in amoums whkb inhibit natttnily occuring pmox activky. 

44. The plant of claim 4 L wherein said pliott is setetedffomte 
wheat, sorghum, rye. om. turf grass, noe. soybean, cottom loboxo, sugar beet and oilseed 

25 

45. A method for controlling the growth of uodesired vegetation which contprt^ Wlyi'g to a 
^ p<n>uUtionoftheplantofclaim4I «aeffcctivemoimtofipcotox-i^^ 

46. ThemethodofcUim45wheftinsjudplamisselecfedfn^ aoybeio, 
30 ~ cottoli. tobacco, sugar beet, oilseed tape, inaixe, wheat, soigku^ 




'03f07^14B(|)H!|22j} )bI:B S K B 



R:624 



P. 85 



10 



phenylpyitoJidino- «nd pi|«ridil«rt«i-F •-te^ ^-M P»««Wl-- 

< (FonmiltV) 

wteicinQequilB 



48. Tte 




15 



■ — O OR 



o 



(Formula VT) 



o OR H- OR 

(Ftorawlivn) (Fprnrntavni) 



(FonrnikK) 



OR OR 

(Formula DCa) (FbniiutaKb) 
wh«in R. e^-aJs H, a or F. R. a «w! R, I. « CT^ly 

20 hetcfocydic rinr 



49. TheiT^thodofGUim48 



whc<rin Slid imi<k is selected ftom ihc 



81 



^ (PoftmiUXV): 



COOR (FcmnuU 5CVII) 
wtmia R signifies (CM-tlkcnykixy)crtai>l-Ci.r«tt^ 



&3 



'^mim{^]imn ji.-B skb 




-.Hi p. 



50. The RMfaod of ctftbn 45 whoem 



10 



15 




(FbrnmUXVai), 



° CH, 



(FonnuUXX),M<i 




.c» 



(FonwibXXIX 



.•<•,', , . .. 




9 The pfCMK ffvcniMM pny^^Wd ficwl DMA 

MfllkA#^Brfta^^MBmA^HAdk^^^K ^^'J^^^^ ^^H^^^^^^^M f^^^i^^ 

pnHOpiXpnjTWQIpsn OKMIW (pvlO»/ CliiJIlBI IIOTI 

invcniioo lonchci modUiod fanwcf ths pMCcAfltfyiiw wMcii 
csxpittiiin the hcrtiicidf mlwii piotoE cmywrt t fc i^h i intm liip pio^idpA 
my be cA^iMNffid for itiittnco id pfuoi tnhibiion vitf wnunoti of riv 
10 Rf ixttiU form or ihcy nMy cwiifonficd with ■ 
pianipraioK cKzymc 



